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Preface 


Now that half a century has elapsed since the special and general 
theories of relativity were constructed, it is possible to perceive more 
clearly their true significance for the development of mathematical 
physics as a whole. Although these theories possessed a decidedly 
revolutionary appearance when they were announced, it has now be- 
come clear that they represented the natural termination for the classi- 
cal theories of mechanics and electromagnetism, rather than a break 
with these systems of ideas and the inception of a new line of thought. 
In this respect, the relativity theories are to be contrasted with that 
other great achievement of modern theoretical physics, namely 
quantum mechanics, based as this is upon principles which are com- 
pletely at variance with those which were fundamental for Newtonian 
mechanics. That relativity theory has proved to be somewhat sterile 
by comparison with the enormous fertility of the new ideas introduced 
in quantum theory may, perhaps, be partly accounted for by this 
difference. However, the circumstance that the applications of rela- 
tivity theory are chiefly to problems in the fields of astronomy and 
cosmology, fields which until recently have not received great atten- 
tion from physicists due to the difficulty of experimentation, whereas 
quantum mechanics is associated with phenomena of atomic dimen- 
sions more easily investigated in the laboratory, may also have a 
bearing upon the matter. Now that interplanetary and even inter- 
stellar instrumented probes are approaching realization, it may be 
that interest in cosmological problems will be stimulated and that 
significant advances from the position reached by relativity theory 
some decades ago will result. 

However, relativity theory’s present status is as the culmination of 
the classical theories of mechanics and electromagnetism and, since 
much time is normally spent during undergraduate courses giving a 
detailed account of these classical theories, it has always seemed to me 
improper that such courses should be terminated without a descrip- 
tion being given of the very natural and illuminating form into which 
it is possible to cast these theories by use of the device, basic for 
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relativity theory, of mapping events ona four-dimensional space-time 
manifold. Such an introductory course of lectures on relativity theory 
helps to clarify the principles upon which the classical theories are 
based and is far more rewarding from the student’s point of view than 
is a course devoted to the solution of intricate problems of statics and 
dynamics. By some sacrifice of the time usually devoted to conven- 
tional techniques of classical mechanics and electromagnetism, it is 
possible to provide an introductory course of relativity theory suitable 
for students in their third honours year at the university who, judging 
from our experience at Canterbury, are at this stage sufficiently 
mature mathematically to appreciate the great beauty of this system 
of ideas. I have delivered such a course of lectures at Canterbury over 
a number of years and this book has grown out of the lecture notes I 
have prepared. Although there is no shortage of excellent accounts of 
the subject of this book (many are listed in the bibliography at the end) 
the great majority of these are intended for study by post-graduate 
students who are specializing in this or related fields and are not suit- 
able as adjuncts to an introductory course of lectures at the under- 
graduate level. I hope, therefore, that there is a place for this book and 
that university teachers who are responsible for courses of this type 
and their students will find it helpful. 

The plan of the book is as follows: The basic idea of the covariance 
of physical laws with respect to transformations between reference 
frames is introduced in Chapter 1 and is illustrated by showing that 
the laws of Newtonian mechanics are covariant with respect to trans- 
formations between inertial frames. This leads to the special principle 
of relativity and its verification for electrodynamics by the Michelson- 
‘Morley experiment. The details of this crucial experiment are not dis- 
cussed and its aftermath is only briefly alluded to, since I feel that, 
though of great historical interest, now that more than half a century 
has elapsed since it was performed, its significance for the special 
theory, so strongly supported by numerous other experimental results, 
is not now so great as formerly. The remainder of the first chapter is 
devoted to the Lorentz transformation which, from the outset, is 
treated as a rotation of rectangular Cartesian axes in Minkowski 
space-time. I have thought it desirable to relieve the undergraduate 
reader, for whom this book is primarily intended, of the necessity for 
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coping immediately with the full rigours of general tensor calculus and 
I have therefore treated the Minkowski space-time of the special 
theory as a Euclidean manifold, with the formalism of which he will 
already be familiar. The time components of vectors are then, of 
course, purely imaginary, but this is salutary in one respect for it 
serves to emphasize the basic physical distinction which exists between 
space and time measurements and so to check the impression that 
relativity theory implies that space and time are basically of the same 
nature. The theory of tensors relative to rectangular Cartesian co- 
ordinate frames in an N-dimensional Euclidean space is developed in 
Chapter 2 and this is made use of in the succeeding two chapters to 
define the principal vectors and tensors of special relativity physics in 
the Minkowski space-time continuum. All such vectors and tensors 
are denoted by bold roman capital letters and their components by 
the corresponding italic characters with appropriate subscripts (e.g. 
F, F; for the 4-force vector and its components). The corresponding 
3-vectors defined with respect to the rectangular axes employed by an 
inertial observer are denoted by bold roman lower case letters and 
their components by the corresponding italic lower case symbols with 
subscripts (e.g. f, f; for the 3-force vector and its components). All sets 
of transformation equations relating the components of a 3-vector 
relative to different inertial frames are consistently obtained from the 
corresponding 4-vector transformation equations with respect to a 
change of axes in space-time. The reader will accordingly become very 
familiar with this important technique. The laws of mechanics are all 
expressed in covariant four-dimensional form in Chapter 3 and the 
electrodynamic laws are exhibited in this form in Chapter 4. A student 
of physics, for whom the general theory of relativity is of less interest, 
should proceed no further than Chapter 4 except, perhaps, to read 
sections 45 and 46 in which the approach to the general principle is 
outlined. 

The techniques of general tensor calculus, which are employed when 
expounding the general theory of relativity, are explained in Chapter 5. 
The algebra and analysis of tensors and relative tensors are first con- 
structed in an affinely connected space, the special case of a Rieman- 
nian space where the affinity is conveniently related to the metric being 
studied later. The coordinates of a point are here denoted by x’ (rather 
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than x;), for the obvious reason that the dx! are the components of a 
contravariant vector and that I have never found the apologies for 
employing subscripts at all acceptable. I am certain that this practice 
magnifies, quite unnecessarily, the difficulties of the beginner. 

The reason why a theory, which accepts as basic the general prin- 
ciple of relativity, must necessarily be a theory of gravitation is first 
explained in Chapter 6. This leads to Einstein’s law of gravitation, the 
Schwarzschild spherically symmetric metric for empty space with a 
point singularity and the three standard physical tests of the theory. 
It will, perhaps, prevent confusion if it is here remarked that, the 
interval ds between two neighbouring points of space-time is defined 
in such a way that, if (x,y,z) are rectangular Cartesian coordinates 
relative to an inertial frame falling freely in the gravitational field and 
t is time in this frame, then 


ds* = dx*+dy’+dz2*—c* dt’. 


For the mode of development chosen in this book, this proves to be 
the most convenient definition of ds. 

When giving consideration to the manner in which this material 
should be presented to the reader, I have greatly profited by referring 
to the original memoirs of the subject and to the books listed in the 
bibliography. All these works have influenced my own presentation to 
some extent, but I have found the accounts by Einstein, Moller and 
Schrédinger especially stimulating. I am happy to acknowledge my 
indebtedness to the authors of all books which I have consulted. Also, 
tomy colleague Professor W. R. Andress, who has read the manuscript 
and made many valuable suggestions, I tender my grateful thanks. As 
a result of our discussions a number of errors and obscurities have 
been removed. 

Certain of the exercises have been taken from examination 
papers set at the Universities of Cambridge, London and Liverpool. 
These have been indicated as follows: M.T. (Mathematics Tripos), 
L.U. (London) and Li. U. (Liverpool). I am grateful to the author- 
ities concerned for permission to make use of these. 

D. F. LAWDEN 
Mathematics Department University of Canterbury 
Christchurch, N.Z. September 1960 


Preface to the Second Edition 


A small number of typographical errors have been corrected, but 
the chief improvement has been the addition of a large number of 
new exercises, many of which have been taken from examination 
papers set at the University of Canterbury. There are now 114 of 
these exercises and it is hoped that these will provide the student with 
an adequate collection for the purpose of testing his understanding 
of the text. 


D. F. LAWDEN 


Mathematics Department University of Canterbury, Christchurch, 
N.Z. September, 1966. 
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CHAPTER 1 


Special Principle of Relativity. Lorentz 
Transformations 


1. Newton’s laws of motion 

A proper appreciation of the physical content of Newton’s three laws 
of motion is an essential prerequisite for any study of the special 
theory of relativity. It will be shown that these laws are in accordance 
with the fundamental principle upon which the theory is based and 
thus they will also serve as a convenient introduction to this principle. 

The first law states that any particle which is not subjected to forces 
moves along a straight line at constant speed. Since the motion of a 
particle can only be specified relative to some coordinate frame of 
reference, this statement has meaning only when the reference frame 
to be employed when observing the particle’s motion has been indi- 
cated. Also, since the concept of force has not, at this point, received a 
definition, it will be necessary to explain how we are to judge when a 
particle is ‘not subjected to forces’. It will be taken as an observed fact 
that if rectangular axes are taken with their origin at the centre of the 
sun and these axes do not rotate relative to the most distant objects 
known to astronomy, viz. the extra-galactic nebulae, then the motions 
of the neighbouring stars relative to this frame are very nearly uniform. 
The departure from uniformity can reasonably be accounted for as 
due to the influence of the stars upon one another and the evidence 
available suggests very strongly that if the motion of a body in a 
region infinitely remote from all other bodies could be observed, 
then its motion would always prove to be uniform relative to our 
reference frame irrespective of the manner in which the motion was 
initiated. 

Weshall accordingly regard the first law as asserting that, ina region 
of space remote from all other matter and empty save for a single test 
particle, a reference frame can be defined relative to which the particle 
will always have a uniform motion. Such a frame will be referred to as 
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an inertial frame. An example of such an inertial frame which is con- 
veniently employed when discussing the motions of bodies within the 
solar system has been described above. However, if S is any inertial 
frame and S$ is another frame whose axes are always parallel to those of 
S but whose origin moves with a constant velocity u relative to S, then 
S also is inertial. For, if v, ¥ are the velocities of the test particle relative 
to S, § respectively, then 

¥ = v-u (1.1) 
and, since v is always constant, so is ¥. It follows, therefore, that a 
frame whose origin is at the earth’s centre and whose axes do not 
rotate relative to the stars can, for most practical purposes, be looked 
upon as an inertial frame, for the motion of the earth relative to the 
sun is very nearly uniform over periods of time which are normally 
the subject of dynamical calculations. In fact, since the earth’s rota- 
tion is slow by ordinary standards, a frame which is fixed in this body 
can also be treated as approximately inertial and this assumption will 
only lead to appreciable errors when motions over relatively long 
periods of time are being investigated, e.g. Foucault’s pendulum, long 
range gunnery calculations. 

Having established an inertial frame, if it is found by observation 
that a particle does not have a uniform motion relative to the frame, 
the lack of uniformity is attributed to the action of a force which is 
exerted upon the particle by some agency. For example, the orbits of 
the planets are considered to be curved on account of the force of 
gravitational attraction exerted upon these bodies by the sun and when 
a beam of charged particles is observed to be deflected when a bar 
magnet is brought into the vicinity, this phenomenon is understood to 
be due to the magnetic forces which are supposed to act upon the 
particles. If v is the particle’s velocity relative to the frame at any 
instant ft, its acceleration a = dv/dt will be non-zero if the particle’s 
motion is not uniform and this quantity is accordingly a convenient 
measure of the applied force f. We take, therefore, 


faa, 
or f = ma, (1.2) 


where m is a constant of proportionality which depends upon the 
particle and is termed its mass. The definition of the mass of a particle 
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will be given almost immediately when it arises quite naturally out of 
the third law of motion. Equation (1.2) is essentially a definition of 
force relative to an inertial frame and is referred to as the second law 
of motion. It is sometimes convenient to employ a non-inertial frame 
in dynamical calculations, in which case a body which is in uniform 
motion relative to an inertial frame and is therefore subject to no 
forces, will nonetheless have an acceleration in the non-inertial frame. 
By equation (1.2), to this acceleration there corresponds a force, but 
this will not be attributable to any obvious agency and is therefore 
usually referred to as a ‘fictitious’ force. Well-known examples of such 
forces are the centrifugal and Coriolis forces associated with frames 
which are in uniform rotation relative to an inertial frame, e.g. a frame 
rotating with the earth. By introducing such ‘fictitious’ forces, the 
second law of motion becomes applicable in all reference frames. 
According to the third law of motion, when two particles P and Q 
interact so as to influence one another’s motion, the force exerted by P 
on Q is equal to that exerted by Q on P but is in the opposite sense. 
Defining the momentum of a particle relative to a reference frame as 
the product of its mass and its velocity, it is proved in elementary text- 
books that the second and third laws taken together imply that the 
sum of the momenta of any two particles involved in a collision is 
conserved. Thus, if 7;, mz are the masses of two such particles and 
u,, U2 are their respective velocities immediately before the collision 
and vj, V2 are their respective velocities immediately afterwards, then 


m,Wy+m2U = Mm Vv; +22 (1.3) 
° m 
i.e. — (ua —v2) = 4 —-Uy. (1.4) 
m, 


This last equation implies that the vectors u2 — V2, Vi — Wy are parallel, 
a result which has been checked experimentally and which constitutes 
the physical content of the third law. However, equation (1.4) shows 
that the third law is also, in part, a specification of how the mass of a 
particle is to be measured and hence provides a definition for this 
quantity. For 

m, _ |vy— wl (1.5) 

m, |U2—Val 
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and hence the ratio of the masses of two particles can be found from 
the results of a collision experiment. If, then, one particular particle is 
chosen to have unit mass (e.g. the standard gramme, pound, etc.), the 
masses of all other particles can, in principle, be determined by per- 
mitting them to collide with this standard and then employing 
equation (1.5). 


2. Covariance of the laws of motion 

It has been shown in the previous section that the second and third 
laws are essentially definitions of the physical quantities force and 
mass relative to a given reference frame. In this section, we shall 
examine whether these definitions lead to different results when differ- 
ent inertial frames are employed. 

Consider first the definition of mass. If the collision between the 
particles 7m,, mz is observed from the inertial frame S, let i, i, be the 
particle velocities before the collision and ¥,, ¥ the corresponding 
velocities after the collision. By equation (1.1), 


ui; = u,—u, etc. (2.1) 
and hence 
¥,—-U, = V;—-Uy, U2 —V2 = U— V2. (2.2) 


It follows that if the vectors v;—u,, u,—v, are parallel, so are the 
vectors ¥; — ti;, i, —¥2 and consequently that, in so far as the third law 
is experimentally verifiable, it is valid in all inertial frames if it is valid 
in one. Now let 771;, 772 be the particle masses as measured in §. Then, 
by equation (1.5), 


M2 _ Wirt] lvi-ay| mm (2.3) 
™ |iig—¥2|  |ue—vo] omy 


But, if the first particle is the unit standard, then m, =m, = 1 and 
hence 


My = Mh, (2.4) 
i.e. the mass of a particle has the same value in all inertial frames. We 


can express this by saying that mass is an invariant relative to trans- 
formations between inertial frames. 


SPECIAL PRINCIPLE OF RELATIVITY 5 


By differentiating equation (1.1) with respect to the time #, since u 
is constant it is found that 
a=a, (2.5) 


where a, 4 are the accelerations of a particle relative to S, S respec- 
tively. Hence, by the second law (1.2), since m = m, it follows that 


f= f, (2.6) 


i.e. the force acting upon a particle is independent of the inertial 
frame in which it is measured. 

It has therefore been shown that equations (1.2), (1.4) take precisely 
the same form in the two frames, S, S, it being understood that mass, 
acceleration and force are independent of the frame and that velocity 
is transformed in accordance with equation (1.1). When equations 
preserve their form upon transformation from one reference frame to 
another, they are said to be covariant with respect to such a transfor- 
mation. Newton’s laws of motion are covariant with respect to a 
transformation between inertial frames. 


3. Special principle of relativity 
The special principle of relativity asserts that all physical laws are 
covariant with respect to a transformation between inertial frames. This 
implies that all observers moving uniformly relative to one another 
and employing inertial frames will be in agreement concerning the 
statement of physical laws. No such observer, therefore, can regard 
himself as being in a special relationship to the universe not shared by 
any other observer employing an inertial frame; there are no privi- 
leged observers. When man believed himself to be at the centre of 
creation both physically and spiritually, a principle such as that we 
have just enunciated would have been rejected as absurd. However, 
the revolution in attitude to our physical environment initiated by 
Copernicus has proceeded so far that today the principle is accepted 
as eminently reasonable and very strong evidence contradicting the 
principle would have to be discovered to disturb it as a foundation 
upon which theoretical physics is based. 

It has been shown already that Newton’s laws of motion obey the 
principle. Let us now transfer our attention to another set of funda- 
mental laws governing non-mechanical phenomena, viz. Maxwell’s 


6 TENSOR CALCULUS AND RELATIVITY 


laws of electrodynamics. These are more complex than the laws of 
Newton and are most conveniently expressed by the equations 


curlE = sar (3.1) 
1 oE 

curlH = = (4+ 5). (3.2) 
c or 

divE = 47, (3.3) 

divH = 0, (3.4) 


where E, H are the electric and magnetic field intensities respectively, 
jis the current density, p is the charge density and the region of space 
being considered is assumed to be empty save for the presence of the 
electric charge. Units have been taken to be Gaussian, so that c is the 
ratio of the electromagnetic unit of charge to the electrostatic unit 
(= 3x 10!° cm/sec). Experiment confirms that these equations are 
valid when any inertial frame is employed. The most famous such 
experiment was that carried out by Michelson and Morley, who veri- 
fied that the velocity of propagation of light waves in any direction is 
always measured to be c relative to an apparatus stationary on the 
earth. As is well-known, light has an electromagnetic character and 
this result is predicted by the equations (3.1)—(3.4). However, the 
velocity of the earth in its orbit at any time differs from its velocity six 
months later by twice the orbital velocity, viz. 60 km/sec and thus, by 
taking measurements of the velocity of light relative to the earth on 
two days separated by this period of time and showing them to be 
equal, it is possible to confirm that Maxwell’s equations conform to 
the special principle of relativity. This is effectively what Michelson 
and Morley did. However, this interpretation of the results of their 
experiment was not accepted immediately, since it was thought that 
electromagnetic phenomena were supported by a medium called the 
aether and that Maxwell’s equations would prove to be valid only in 
an inertial frame stationary in this medium, i.e. the special principle 
of relativity was denied for electromagnetic phenomena. The con- 
troversy which ensued is of great historical interest, but will not be 
recounted in this book. The special principle is now firmly established 
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and is accepted on the grounds that the conclusions which may be 
deduced from it are everywhere found to be in conformity with experi- 
ment and also because it is felt to possess a priori a high degree of 
plausibility. A description of the steps by which it ultimately came to 
be appreciated that the principle was of quite general application 
would therefore be superfluous in an introductory text. It is, however, 
essential for our future development of the theory to understand the 
prime difficulty preventing an early acceptance of the idea that the 
electromagnetic laws are in conformity with the special principle. 
Consider the two inertial frames S, 5. Suppose that an observer 
employing S measures the velocity of a light pulse and finds it to be c. 
If the velocity of the same light pulse is measured by an observer 
employing the frame 5, let this be ¢. Then, by equation (1.1), 


c¢=c—-u (3.5) 


and it is clear that, in general, the magnitudes of the vectors ¢, ¢ will 
be different. It appears to follow, therefore, that either Maxwell’s 
equations (3.1)-(3.4) must be modified, or the special principle of 
relativity abandoned for electromagnetic phenomena. Attempts were 
made (e.g. by Ritz) to modify Maxwell’s equations, but certain conse- 
quences of the modified equations could not be confirmed experi- 
mentally. Since the special principle was always found to be valid, the 
only remaining alternative was to reject equation (1.1) and to replace 
it by another in conformity with the experimental result that the speed 
of light is the same in all inertial frames. As will be shown in the next 
section, this can only be done at the expense of a radical revision of our 
intuitive ideas concerning the nature of space and time and this was 
very understandably strongly resisted. 


4. Lorentz transformations. Minkowski space-time 

Let the reference frame S comprise rectangular Cartesian axes Oxyz. 
We shall assume that the coordinates of a point relative to this frame 
are measured by the usual procedure and employing a measuring 
scale which is stationary in SS (it is necessary to state this precaution, 
since it will be shown later that the length of a bar is not independent 
of its motion). It will also be supposed that clocks, stationary relative 
to S, are distributed throughout space and are all synchronized with a 
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master clock at O. A satisfactory synchronization procedure would be 
as follows: Warn observers at all clocks that a light source at O will 
commence radiating at t = tg. When an observer at a point P first 
receives light from this source, he is to set the clock at P to read 
to+ OP/c, i.e. it is assumed that light travels with a speed c relative to 
S, as found by experiment. The position and time of an event can now 
be specified relative to S by four coordinates (x, y, z, t), t being the time 
shown on the clock which is contiguous to the event. We shall often 
refer to the four numbers (*, y, z, f) as an event. 

Let Oxjz be rectangular Cartesian axes determining the frame $ 
(to be precise, these are rectangular as seen by an observer stationary 
in S$) and suppose that clocks at rest relative to this frame are synchro- 
nized with a master at O. Any event can now be fixed relative to $ by 
four coordinates (X, ¥, Z,7), the space coordinates being measured by 
scales which are at rest in 5 and the time coordinate by the contiguous 
clock at rest in S. If (x, y,z, t), (%,%, 2,2) relate to the same event, in this 
section we are concerned to find the equations relating these corre- 
sponding coordinates. 

The possibility that the length of a scale and the rate of a clock may 
be affected by uniform motion relative to a reference frame was 
ignored in early physical theories. Velocity measurements were agreed 
to be dependent upon the reference frame, but lengths and time 
measurements were thought to be absolute. We shall make no such 
assumption, but will choose the equations relating the coordinates of 
an event in the two frames to be of such a form that (i) a particle which 
has uniform motion relative to one frame, has uniform motion relative 
to the other and (ii) the velocity of propagation of light is the same 
constant c in both frames. Unless (i) is true, Newton’s first law must 
be abandoned and, with it, the very concept of an inertial frame. 
Experimental results force us to accept (ii). 

To comply with requirement (i), we shall assume that each of the 
coordinates (X, ¥, Z,7) is a linear function of the coordinates (x, y, z, t). 
The inverse relationship is then of the same type. A particle moving 
uniformly in S with velocity (v,,v,,v,) will have space coordinates 
(x,y, Z) such that 


X= Xptv,t, Y=Yotv,t, z=AzHtHv,t. 4) 
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If linear expressions in the coordinates (x, ¥,Z, 7) are now substituted 
for (x, y, Z, t), it will be found on solving for (x, ¥, Z) that these quantities 
are linear in / and hence that the particle’s motion is uniform relative 
to 5. In fact, it may be proved that only a linear transformation can 
satisfy the requirement (i). 

Now suppose that at the instant t = fg a light source situated at the 
point Po (xo,¥0,Zo) in S radiates a pulse of short duration. At any 
later instant t, the wavefront will occupy the sphere whose centre is 
Po and radius c(t— to). This has equation 


(x— x9)? + (y—yo)? + (z—x0)* = C(t)’. (4.2) 
Let (Xo, 9, Zo) be the coordinates of the light source as observed from 
S at the instant 7 = 7p the short pulse is radiated. At any later instant 7, 
in accordance with requirement (ii), the wavefront must also appear 


from S$ to occupy asphere of radius c(f — fp) and centre (Xo, Fp,Z9). This 
has equation 

(% — Xp)? + (G¥—-Fo)* + (Z—Z)” = C7(F—)’. (4.3) 
Equations (4.2), (4.3) describe the same set of events in languages 
appropriate to S, 5 respectively. It follows that the equations relating 
the coordinates (x, y, z, t), (¥, 9, Z,#) must be so chosen that, upon sub- 
stitution for the ‘barred’ quantities appearing in equation (4.3) the 
appropriate linear expressions in the ‘unbarred’ quantities, equation 
(4.2) results. 

A mathematical device due to Minkowski will now be employed. 
We shall replace the time coordinate ¢ of any event observed in S by 
a purely imaginary coordinate x, = ict (i= ~/—1). The space co- 
ordinates (x,y,z) of the event will be replaced by (x;,x>,x3) so that 


X=X1, VHX, Z=X3, ict = X4 (4.4) 


and any event is then determined by four coordinates x; (i = 1,2,3, 4). 
A similar transformation to coordinates X; will be carried out in S. 
Equations (4.2), (4.3) can then be written 


X (x;—x;0)" = 0, (4.5) 


(% —Xjo)? = 0. (4.6) 
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The x; are to be linear functions of the x; and such as to transform 
equation (4.6) into equation (4.5) and hence such that 


4 4 
2 iF) > kB Gi x0). (4.7) 


k can only depend upon the relative velocity of Sand S. Itis reasonable 
to assume that the relationship between the two frames is a reciprocal 
one, so that, when the inverse transformation is made from Sto S, then 


4 4 | 
p> (x}—Xjo)” > k 2 (%;— X40)’ (4.8) 


But the transformation followed by its inverse must leave any function 
of the coordinates %,; unaltered and hence k? = 1. In the limit, as the 
relative motion of S and S is reduced to zero, it is clear that k > +1. 
Hence k 4 — 1 and we conclude that k is identically unity. 

The x; will now be interpreted as rectangular Cartesian coordinates 
in a four-dimensional Euclidean space which we shall refer to as &4. 
This space is termed Minkowski space-time. The left-hand member of 
equation (4.5) is then the square of the ‘distance’ between two points 
having coordinates x;, x;q. It is now clear that the x; can be interpreted 
as the coordinates of the point x; referred to some other rectangular 
Cartesian axes in &,. For such an interpretation will certainly enable 
us to satisfy the requirement (4.7) (with k = 1). Also, the x;, x; will 
then be related by equations of the form 


4 
x; = 2 ay xj+ b;, (4.9) 


where i = 1,2,3,4 and the a;;, b; are constants and this relationship is 
linear. The 5; are the coordinates of the origin of the first set of rect- 
angular axes relative to the second set. The a,; will be shown to satisfy 
certain identities in Chapter 2 (equations (8.14), (8.15)). It is proved 
in algebra texts that the relationship between the x; and X; must be of 
the form we are assuming, if it is (i) linear and (ii) such as to satisfy 
the requirement (4.7). . 

Changing back from the x;, x; to the original coordinates of an 
event by equations (4.4), the equations (4.9) provide a means of re- 
lating space and time measurements in S with the corresponding 
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measurements in 5. Subject to certain provisos (e.g. an event which 
has real coordinates in S, must have real coordinates in $), this trans- 
formation will be referred to as the General Lorentz Transformation. 


5, The special Lorentz transformation 

We shall now investigate the special Lorentz transformation obtained 
by supposing that the X,-axes in &, are obtained from the x,-axes by a 
rotation through an angle « parallel to the x; x4-plane. The origin and 


Xe 


Fic. 1 


the x», x3-axes are unaffected by the rotation and it will be clear after 
consideration of Fig. 1 therefore that 


x, = X,COSa+x4SiN a, Xz = X2, 

X4= -X, sina+x4CoSa, X3 X3- 
Employing equations (4.4), these transformation equations may be 
written 


(5.1) 


X = xcosatictsina, L= | (5.2) 


icf = —xsinaetictcosa, Z = Z. 
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To interpret the equations (5.2), consider a plane which is stationary 
relative to the § frame and has equation 


ax+by+éz+d=0 (5.3) 
for all 7. Its equation relative to the S frame will be 
(a4cos «)x+by+éz+d+ ictasina = 0, (5.4) 


at any fixed instant ¢. In particular, if d= 5 = d=0, this is the co- 
ordinate plane Ox and its equation relative to S is z = 0, i.e. it is the 


Zz Z 


plane Oxy. Again, if b = ¢ = d= 0, the plane is O72 and its equation 
in S is 

xX = —icttana, (5.5) 
i.e. it is a plane parallel to Oyz displaced a distance — icttan« along 
Ox. Finally, if d = € = d= 0, the plane is OZ% and its equation with 
respect to S is y = 0, i.e. it is the plane Ozx. We conclude, therefore, 
that the Lorentz transformation equations (5.2) correspond to the 
particular case when the coordinate planes comprising S are obtained 
from those comprising S at any instant ¢ by a translation along Ox a 
distance — icttan« (Fig. 2). Thus, if u is the speed of translation of § 
relative to S 

u= —ictana. (5.6) 
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It should also be noted that the events 


x=y=z=t=0, xK=py=72=f=0 


correspond and hence that, at the instant O and O coincide, the S and 
S clocks at these points are supposed set to have zero readings; all 
other clocks are then synchronized with these. 

Equation (5.6) indicates that « is imaginary and is directly related 
to the speed of translation. We have tan « = iu/c and hence 


1 (iu/c) 
V/(l—w ey’ V/(1—u?/c’) 


Substituting in the equations (5.2), the special Lorentz transformation 
is obtained in its final form, viz. 


cosa = sina = (5.7) 


__  xaut ee 

x= Vi-w /c*)’ v= ys 2 i; 

j= t—(ux/c’) sen. 
~ Va—wey * ~ * 


If u is small by comparison with c, as is generally the case, these 
equations may evidently be approximated by the equations 


X= x-ut, Ap (5.9) 
z . 


= f, Z= 2Z. 


This set of equations, called the Special Galilean Transformation 
equations, is, of course, the set which was assumed to relate space and 
time measurements in the two frames in classical physical theory. 
However, the equation / = t was rarely stated explicitly, since it was 
taken as self-evident that time measurements were absolute, i.e. quite 
independent of the observer. It appears from equations (5.8) that this 
view of the nature of time can no longer be maintained and that, in 
fact, time and space measurements are related, as is shown by the 
dependence of 7 upon both t and x. This revolutionary idea is also 
suggested by the manner in which the special Lorentz transformation 
has been derived, viz. by a rotation of axes in a manifold which has 
both spacelike and timelike characteristics. However, this does not 
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imply that space and time are now to be regarded as basically similar 
physical quantities, for it has only been possible to place the time co- 
ordinate on the same footing as the space coordinates in &, by multi- 
plying the former by i. Since x4 must always be imaginary, whereas 
X1, X2, X3 are real, the fundamentally different nature of space and 
time measurements is still maintained in the new theory. 

If u > c, both ¥ and 7 as given by equations (5.8) are imaginary. We 
conclude that no observer can possess a velocity greater than that of 
light relative to any other observer. 

If equations (5.8) are solved for (x,y,z, ft) in terms of (X,¥,Z,#), it 
will be found that the inverse transformation is identical with the 
original transformation, except that the sign of u is reversed. This also 
follows from the fact that the inverse transformation corresponds to 
a rotation of axes through an angle — « in space-time. Thus, the frame 
S has velocity —u when observed from 8. 


6. Fitzgerald contraction. Time dilation 

In the next two sections, we shall explore some of the more elementary 

physical consequences of the transformation equations (5.8). 
Consider first a rigid rod stationary in S and lying along the X-axis. 

Let X = X,, ¥ =X, at the two ends of the bar so that its length as 

measured in S is given by 


[= X2—-X4. (6.1) 


At the instant ¢ in S, suppose these ends occupy the positions x = x), 
X =X». Then, by equations (5.8), 


x,—ut :. X2—ut 


VOW]? >? ~ Va wye2) 


But x2 —.x, = lis the length of the bar as measured in S and it follows 
by subtraction of equations (6.3) that 


= 1/(1—u*/c’). (6.3) 


x, = (6.2) 


The length of a bar accordingly suffers contraction when it is moved 
longitudinally relative to an inertial frame. This is the Fitzgerald 
contraction. 
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This contraction is not to be thought of as the physical reaction of 
the rod to its motion and as belonging to the same category of physical 
effects as the contraction of a metal rod when it is cooled. It is due toa 
changed relationship between the rod and the instruments measuring 
its length. is a measurement carried out by scales which are stationary 
relative to the bar, whereas /is the result of a measuring operation with 
scales which are moving with respect to the bar. Also, the first opera- 
tion can be carried out without the assistance of a clock, but the second 
operation involves simultaneous observation of the two ends of the 
bar and hence clocks must be employed. In classical physics, it was 
assumed that these two measurement procedures would yield the same 
result, since it was supposed that a rigid bar possessed intrinsically an 
attribute called its length and that this could in no way be affected by 
the procedure employed to measure it. It is now understood that 
length, like every other physical quantity, is defined by the procedure 
employed for its measurement and that it possesses no meaning apart 
from being the result of this procedure. From this point of view, it is 
not surprising that, when the procedure must be altered to suit the 
circumstances, the result will also be changed. It may assist the reader 
to adopt the modern view of the Fitzgerald contraction if we remark 
that the length of the rod considered above can be altered at any 
instant by simply changing our minds and commencing to employ the 
S frame rather than the 5 frame. Clearly, such a change of mathe- 
matical description can have no physical consequences. 

Now consider two events which have coordinates (x1,¥1,2;,¢), 
(%2,¥2,Z2,t) in S and hence occur simultaneously at different points. 
These events are not necessarily simultaneous in S. For, if #= % at 
the first mentioned event and é = 72 at the second, then 


i, = 33041 —x2)/ VL”) (6.4) 


and 7, ¥ é,, unless x, = x2. The concept of simultaneity is accordingly, 
also, a relative one and has no absolute meaning as was previously 
thought. 

The registration by the clock moving with O of the times é,, #2, con- 
stitutes two events having coordinates (0,0,0,#,), (0,0,0,#2) respec- 
tively in 5. Employing the inverse transformation to (5.8), it follows 
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that the times f,, tf. of these events as measured in S are given by 
t= A/V-w/e), th = b/V(-w/c?), (6.5) 
and hence that 
i-th = (4-h)V(1-w’/c’). (6.6) 


This equation shows that the clock moving with O will appear from S$ 
to have its rate reduced by a factor 4/(1—u?/c”). This is the time 
dilation effect. 

Since any cyclic physical process, i.e. one which returns to some 
initial state after the lapse of a period of time, can be employed as a 
clock, the result just obtained implies that all physical processes will 
evolve more slowly when observed from a frame relative to which 
they are moving. Thus, the rate of decay of radioactive particles pres- 
ent in cosmic rays and moving with high velocities relative to the earth, 
has been observed to be reduced by exactly the factor predicted by 
equation (6.6). 

It may also be deduced that, if a human passenger were to be 
launched from the earth in a rocket which attained a speed approach- 
ing that of light and after proceeding to a great distance returned to 
the earth with the same high speed, suitable observations made from 
the earth would indicate that all physical processes occurring within 
the rocket, including the metabolic and physiological processes taking 
place inside the passenger’s body, would suffer a retardation. Since all 
physical processes would be affected equally, the passenger would be 
unaware of this effect. Nonetheless, upon return to the earth he would 
find that his estimate of the duration of the flight was less than the 
terrestrial estimate. It may be objected that the passenger is entitled to 
regard himself as having been at rest and the earth as having suffered 
the displacement and therefore that the terrestrial estimate should be 
less than his own. This is the clock paradox. The paradox is resolved by 
observing that a frame moving with the rocket is subject to an acceler- 
ation relative to an inertial frame and consequently cannot be treated 
as inertial. The results of special relativity only apply to inertial frames 
and the rocket passenger is accordingly not entitled to make use of 
them in his own frame. As will be shown later, the methods of general 
relativity theory are applicable in any frame and it may be proved 


SPECIAL PRINCIPLE OF RELATIVITY 17 


that, if the passenger employs these methods, his calculations will yield 
results in agreement with those obtained by the terrestrial observer. 


7. Spacelike and timelike intervals. Light cone 
We have proved in section 4 that if x;, xjo are the coordinates in 
Minkowski space-time of two events, then 


4 
2 Gx) (7.1) 


is invariant, i.e. has the same value for all observers employing inertial 
frames and thus rectangular axes in space-time. Reverting by equa- 
tions (4.4) to the ordinary space and time coordinates employed in an 
inertial frame, it follows that 


(x—x0)? + (y—Yo)* + (z— 20)? — 7(t— to)’, (7.2) 
is invariant for all inertial observers. 
Thus, if (x,y,z, t), (xo, ¥o,Z0,%0) are the coordinates of two events 


relative to any inertial frame S and we define the proper time interval r 
between the events by the equation 


7 = (tt)? (x— x0 + (Yo)? +(E— 20) (7.3) 


then 7 is an invariant for the two events. Two observers employing 
different inertial frames may attribute different coordinates to the 
events, but they will be in agreement concerning the value of 7. 

Denoting the time interval between the events by 4tand the distance 
between them by 4d, both relative to the same frame S and positive, 
it follows from equation (7.3) that 


f= AP—4 ad. (7.4) 
Cc 


Suppose that a new inertial frame $ is now defined, moving in the 
direction of the line joining the events with speed 4d/4t. This will only 
be possible if 4d/4t < c. Relative to this frame the events will occur 
at the same point and hence 4d = 0. By equation (7.4), therefore, 


r= AP, (7.5) 
i.e. the proper time interval between two events is the ordinary time 
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interval measured in a frame (if such exists) in which the events occur 

at the same space point. In this case, it is clear that +” > 0 and the 

proper time interval between the events is said to be timelike. 
Suppose, if possible, that a frame 5S can be chosen relative to which 


the events are simultaneous. In this frame At =Oand 
a 
P= Ad’. (7.6) 


Thus 7” < 0, and, in any frame, 4d/At > c. 7 is then purely imaginary 
and the interval between the events is said to be spacelike. 

If the interval is timelike, 4d/4t < c and it is possible for a material 
body to be present at both events. On the other hand, if the interval is 
spacelike, 4d/4t > cand it is not possible for such a body to be present 
at both events. The intermediate case is when 4d/At = c and then 
+ = (0. Only a light pulse can be present at both events. It also follows 
that the proper time interval between the transmission and reception 
of a light signal is zero. 

We shall now represent the event (x,y,z, f) by a point having these 
coordinates in a four-dimensional space. This space is also often 
referred to as Minkowski space-time but, unlike the space-time con- 
tinuum introduced in section 4, it is not Euclidean. However, this 
representation has the advantage that the coordinates all take real 
values and it is therefore more satisfactory when diagrams are to be 
drawn. Suppose a particle is at the origin O of S at t= 0 and com- 
mences to move along Ox with constant speed u. Its y- and z-coordin- 
ates will always be zero and the representation of its motion in 
space-time will be confined to the xf-plane. In this plane, its motion 
will appear as the straight line QP, Q being the pointx = y=z=1=0 
(Fig. 3). OP is called the world-line of the particle. If 2PQt= 0, 
tan 0 = u. But |u| < c and hence the world-line of the particle must lie 
in the sector AQB, where 2 AQB = 2« and tana = c. Similarly, the 
world-line of a particle which arrives at O at t = 0 after moving along 
Ox, must lie in the sector A’ QB’. It follows that any event in either of 
these sectors must be separated from the event Q by a timelike inter- 
val, since a particle can be present at both events. Events in the sectors 
AQB’, A’ OB are separated from Q by spacelike intervals. A’ A, B’B 
are the world-lines of light signals passing through O at t= 0 and 
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being propagated in the directions of the positive and negative x-axis 
respectively. 
For any event in AQB, t > 0, i.e. it is in the future with respect to the 


Absolute 
past 
B 


event Q when the frame S is being employed. However, by no choice 
of frame can it be made simultaneous with Q, for this would imply a 
spacelike interval. A fortiori, in no frame can it occur prior to Q. The 
sector AQB accordingly contains events which are in the absolute 
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future with respect to the event Q. Similarly all events in the sector 
A’ QB’ are in the absolute past with respect to Q. On the other hand, 
events lying in the sectors A QB’, A’ QBareseparated from Q by space- 
like intervals and can all be made simultaneous with Q by proper 
choice of inertial frame. These events may occur before or after Q 
depending upon the frame being used. These two sectors define a 
region of space-time which will be termed the conditional present. 

Since no physical signal can have a speed greater than c, the world- 
line of any such signal emanating from Q must lie in the sector AQB. 
It follows that the event Q can be the physical cause of only those 
events which are in the absolute future with respect to Q. Similarly, Q 
can be the effect of only those events in its absolute past. Q cannot be 
causally related to events in its conditional present. 

This state of affairs should be contrasted with the essentially simpler 
situation of classical physics where there is no upper limit to the signal 
velocity and AA’, BB’ coincide along the x-axis. Past and future are 
then separated by a perfectly precise present in which events all have 
the time coordinate t = 0 for all observers. 

In the four-dimensional space Oxyzt, the three regions of absolute 
past, absolute future and conditional present are separated from one 
another by the hyper-cone 


e+y+227—c ? = 0. (7.7) 


A light pulse transmitted from Q will have its world-line on this sur- 
face, which is accordingly called the light cone at Q. Since any arbitrary 
event can be selected to be Q, any event is the apex of a light cone 
which separates the space-time continuum in an absolute manner into 
three distinct regions relative to the event. 


Exercises 1 
1. A particle of mass m is moving in the plane of axes Oxy under 
the action of a force f. Oxy is an inertial frame. Ox’ y’ is rotating rela- 
tive to the inertial frame so that 7.x’ Ox = %and ¢ = w. (r, 9) are polar 
coordinates of the particle relative to the rotating frame. If (/,, f@) 
are the polar components of f, (@,,ag) are the polar components of the 


SPECIAL PRINCIPLE OF RELATIVITY 21 


particle’s acceleration relative to Ox’ y’, v is the particle’s speed rela- 
tive to this frame and ¢ is the angle its direction of motion makes with 
the radius vector in this frame, obtain the equations of motion in the 
form 

ma, = f,+2muvsin¢ + mrw*, 


mag = fga—2mwvcos — mrad. 


Deduce that the motion relative to the rotating frame is in accordance 
with the second law if, in addition to f, the following forces are also 
taken to act on the particle: (i) mw*r radially outwards (the centrifugal 
force), (ii) 2mwv at right angles to the direction of motion (the Coriolis 
force), (iii) srw transversely. (The latter force vanishes if the rotation 
is uniform.) 

2. A bar lies along Ox and is stationary in §. Show that if the po- 
sitions of its ends are observed in S at instants which are simultaneous 
in S, its length deduced from these observations will be greater than 
its length in § by a factor (1—u*/c*)~ ¥/2, 

3. Suppose that the bar referred to in Exercise 2 takes a time T to 
pass a fixed point on the x-axis, T being measured by aclock stationary 
at the fixed point. Defining the length of the bar in the S-frame to be 
uT, deduce the Fitzgerald contraction. 

4. The measuring rod employed by S will appear from 5 to be 
shortened by a factor (1 — u*/c”)'/?. Hence, when S measures the length 
of the bar fixed in § he might be expected to obtain the result 


L= YaA-w/c*)'. 


This contradicts equation (6.3). Resolve the contradiction. [Hint: It 
will be observed from 5§ that S fixes the position of the forward end of 
the bar first and the position of the rear end a time ul/c? later.] 

5. A bar lies stationary along the x-axis of S. Show that the 
world-lines of the particles of the bar occupy a certain ‘band’ in 
the x; x,-plane. By measuring the width of this ‘band’ parallel to the 
X-axis, deduce the Fitzgerald contraction. 

6. Verify that the transformation equations (5.8) are such that 


PLP POP = eye P—AP, 
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7. Two light pulses are moving in the positive direction along the 
x-axis of the frame S, the distance between them being d. Show that, 
as measured in S, the distance between the pulses is 


c—u 

8. Aand Bare two points of an inertial frame S a distance d apart. 
An event occurs at B a time T (relative to clocks in S) after another 
event occurs at A. Relative to another inertial frame 5, the events are 
simultaneous. If AP is a displacement vector in S representing the 
velocity of 5 relative to S, prove that P lies in a plane perpendicular 
to AB, distance c27T/d from A. 

9. S, § are the inertial frames considered in section 5. The length 
of a moving rod, which remains parallel to the x and X axes, is 
measured as a in the frame S and d in the frame §. By consideration 
of a Minkowski diagram for the rod, or otherwise, show that the 
proper length of the rod is 


aaBu/c 
/ (2Bad— a? — a") 


where 8 = (1 — u2/c2)~1/2, 

10. If the position vectors r = (x,y,z), F = (X, 9,2) of an event as 
determined by the observers in the parallel inertial frames S, $ 
respectively are mapped in the same independent @3, prove that 


i = B(t+uer/c?), 


where 8 = (1 — u2/c2)~1/2 and u is the velocity of S as measured from 


11. The centroid and axis of a right circular cylinder are fixed in 
the inertial frame S relative to which the cylinder rotates about its 
axis with uniform velocity w. Prove that, when observed from S, the 
cylinder will appear to be twisted about its axis through an angle 
uw/c? per unit rest-length of the cylinder. 


CHAPTER 2 


Orthogonal Transformations. Cartesian 
Tensors 


8. Orthogonal transformations 

In section 4 events have been represented by points in a space &,4. The 
resulting distribution of points was described in terms of their co- 
ordinates relative to a set of rectangular Cartesian axes. Each such set 
of axes was shown to correspond to an observer employing a rect- 
angular Cartesian inertial frame in ordinary ¢3-space and clocks 
which are stationary in this frame. In this representation, the descrip- 
tions of physical phenomena given by two such inertial observers are 
related by a transformation in &4 from one set of rectangular axes to 
another. Such a transformation has been given at equation (4.9) and 
is called an orthogonal transformation. In general, if x;, %; @= 1, 
2,...,) are two sets of N quantities which are related by a linear 
transformation 


N 
Hp = Dy ayxjt bi, (8.1) 
j=l 
and, if the coefficients a, of this transformation are such that 
a 2 > 2 
p>) (%i-F) = p> (x:-¥i) (8.2) 


is an identity for all corresponding sets x;, X; and ;, ¥;, then the trans- 
formation is said to be orthogonal. It is clear that the x;, x; may be 
thought of as the coordinates of a point in &y referred to two different 
sets of rectangular Cartesian axes and then equation (8.2) states that 
the square of the distance between two points is an invariant, inde- 
pendent of the Cartesian frame. 

Writing z; = x;—y;, Z; = X;—J;, it follows from equation (8.1) that 


Zz; = > Ay Zj- (8.3) 
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Let z denote the column matrix with elements z;, Z the column matrix 
with elements Z,and A the N x N matrix with elements ay. Then the set 
of equations (8.3) is equivalent to the matrix equation 


Z= Az. (8.4) 
Also, if z’ is the transpose of z, 
N 
2’z= >) 2? (8.5) 


and thus the identity (8.2) may be written 

22 = 2°2. (8.6) 
But, from equation (8.4), 

Z’ = 2’ A’, (8.7) 


Substituting in the left-hand member of equation (8.6) from equations 
(8.4), (8.7), it will be found that 


Zz’ A’ Az = 2'2, (8.8) 
This can only be true for all z if 
AA=I, (8.9) 


where J is the unit N x N matrix. 
Taking determinants of both members of the matrix equation (8.9), 
we find that ||? = 1 and hence 


|A| = +1. (8.10) 


A is accordingly regular. Let A~! be its inverse. Multiplication on the 
right by A~! of both members of equation (8.9) then yields 


A= Ae; (8.11) 
It now follows that 
AA’ = AA =I]. (8.12) 


Let 5,; be the ij*" element of J, so that 


$y = 1, oe 


8.1 
=0, ixj. ol) 
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The symbols 8, are referred to as the Kronecker deltas. Equations 
(8.9), (8.12) are now seen to be equivalent to 


N 

p> Gi Gi = Siks (8.14) 
N 

p> Aji Ay = Sik (8.15) 


respectively. These conditions are necessarily satisfied by the co- 
efficients a,; of the transformation (8.1) if it is orthogonal. Conversely, 
if either of these conditions is satisfied, it is easy to prove that equation 
(8.6) follows and hence that the transformation is orthogonal. 


9. Repeated index summation convention 

At this point it is convenient to introduce a notation which will greatly 
abbreviate future manipulative work. It will be understood that, 
wherever in any term of an expression a literal index occurs twice, this 
term is to be summed over all possible values of the index. For 
example, we shall abbreviate by writing 


N 
xX a,b, = a,6,. (9.1) 
r=1 


The index must be a literal one and we shall further stipulate that it 
must be a small letter. Thus a2 52, ay by are individual terms of the 
expression a,b,, and no summation is intended in these cases. 

Employing this convention, equations (8.14) and (8.15) can be 
written 


Ay Qin. = Sir idk: = Ox (9.2) 
respectively. Again, with z; = x; — y;, equation (8.2) may be written 
2,2; = 25 2Zj. (9.3) 


More than one index may be repeated in the same term, in which 
case more than one summation is intended. Thus 


N WN 
Aj Oj Ch = ~, = aj bj, Cre (9.4) 


It is permissible to replace a repeated index by any other small 
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letter, provided the replacement index does not occur elsewhere in 
the same term. Thus 


a;b; = a,b; _ ay, by; (9.5) 
but 
Ay Ain F Aj Qin, (9.6) 


irrespective of whether the right-hand member is summed with respect 
toj or not. A repeated index shares this property with the variable of 
integration in a definite integral. Thus 


f fix)dx = f poyay OD 


A repeated index is accordingly referred to as a dummy index. Any 
other index will be called a free index. 

It will be assumed, in future, that any equation remains true when 
the free indices assume all possible values. Thus, equations (9.2) are 
true for all j= 1,2,..., Nand all k = 1,2,...,N. It is clear that the free 
indices on the two sides of an equation will be identical. 

The reader should note carefully the identity 


8,4; = aj, (9.8) 


for it will be of frequent application. 5,; is often called a substitution 
operator, since when it multiplies a symbol such as a,, its effect is to 
replace the index j by i. 


10. Rectangular Cartesian tensors 
Let x;, y; be rectangular Cartesian coordinates of two points Q, P 
respectively in &,. Writing z; = x;—y;, the z; are termed the compon- 
ents of the displacement vector PQ relative to the axes being used. If 
X;, J; are the coordinates of Q, P with respect to another set of rect- 
angular axes, the new coordinates will be related to the old by the 
transformation equations (8.1). Then, if Z; are the components of PQ 
in the new frame, it follows (equation (8.3)) that 


2, = ay 2). (10.1) 


Any set of N quantities which take the values A; when the first co- 
ordinate frame is being employed and which transform in the same 


ORTHOGONAL TRANSFORMATIONS 27 


manner as the z; when referred to a new coordinate frame, i.e. are 
such that 
A; = aj Aj, (10.2) 


are said to be the components of a vector in & y relative to rectangular 
Cartesian reference frames. We shall frequently abbreviate ‘the vector 
whose components are A,’ to ‘the vector A,’. We shall also denote the 
vector by A. 

If A;, B; are two vectors, consider the N? quantities A,B,. Upon 
transformation of axes, these quantities transform thus: 


A,B; = 034, 4j1 Ay By. (10.3) 


Any set of N? quantities C, which transform in this manner, i.e. which 
are such that 
Cy = ix yt Crt (10.4) 


are said to be the components of a tensor of the second rank. We shall 
speak of ‘the tensor C,;’. Such a tensor is not, necessarily, represent- 
able as the product of two vectors. 

A set of N? quantities Dj;, which transform in the same manner as 
the product of three vectors A;B;C;,, form a tensor of the third rank. 
The transformation law is 


Dix = Ai Qjm An Dimn- (10.5) 


The generalization to a tensor of any rank should now be obvious. 
Vectors are, of course, tensors of the first rank. 

If Ay, By are tensors, the sums A,+ By are N* quantities which 
transform according to the saine law as the A, and By. The sum of two 
tensors of the second rank is accordingly also a tensor of this rank. 
This result can be generalized immediately to the sum of any two 
tensors of identical rank. Similarly, the difference of two tensors of 
the same rank is also a tensor. 

Our method of introducing a tensor implies that the product of any 
number of vectors is a tensor. Quite generally, if Ajy,,., By... are 
tensors of any ranks (which may be different), then the product 
Ay... But... is a tensor whose rank is the sum of the ranks of the two 
factors. The reader should prove this formally for a product such as 
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Ay Brim, by writing down the transformation equations. (N.B. the 
indices in the two factors must be kept distinct, for otherwise a sum- 
mation is implied and this complicates matters; see section 12.) 
The components of a tensor may be chosen arbitrarily relative to 
any one set of axes. The components of the tensor relative to any other 
set are then fixed by the transformation equations. Consider the tensor 
of the second rank whose components relative to the x;-axes are the 
Kronecker deltas 5y- In the X,-frame, the components are 


by = ik 01 8x1 = Ay, aj = Sis (10.6) 


by equations (9.2). Thus this tensor has the same components relative 
to all sets of axes. It is termed the fundamental tensor of the second 
rank. 

If, to take the particular case of a third rank tensor as an example, 


Aix = Ajix (10.7) 


for all values of i, j, k, Ajj; is said to be symmetric with respect to its 
indices i, 7, Symmetry may be with respect to any pair of indices. If 
Ajj, is a tensor, its property of symmetry with respect to two indices is 
preserved upon transformation, for 


Ajix = Aj Aim akn Amn: 


Aim Gi kn Amin 
= Aix, (10.8) 


where, in the second line, we have rearranged and put Aj,.7, = Amin: 
Unless a property is preserved upon transformation, it will be of little 
importance to us, for we shall later employ tensors to express relation- 
ships which are valid for all observers and a chance relationship, true 
in one frame alone, will be of no fundamental significance. 
Similarly, if 
Aix = — Ajix (10.9) 


for all values of i, j, k, Ajj, is said to be skew-symmetric or anti- 
symmetric with respect to its first two indices. This property also is 
preserved upon transformation. Since A,,, = —Aj41x, 411, = 0. All 
components of Aj; with the first two indices the same are clearly zero. 
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A tensor whose components are all zero in one frame, has zero 
components in every frame. A corollary to this result is that if Aj, _., 
By... are two tensors of the same rank whose corresponding compon- 
ents are equal in one frame, then they are equal in every frame. This 
follows because Ay...—By,,, is a tensor whose components are all 
zero in the first frame and hence in every frame. Thus, a tensor equation 


Ay... = By... (10.10) 


is valid for all choices of axes. This explains the importance of tensors 
for our purpose. By expressing a physical law as a tensor equation, we 
shall guarantee its covariance with respect to a change of inertial 
frame. 


11. Invariants. Gradients. Derivatives of tensors 

Suppose that V is a quantity which is unaffected by any change of 
axes. Then V is called a scalar invariant or simply an invariant. Its 
transformation equation is simply 


V= Vz (11.1) 


As will be proved later (section 21), the charge of an electron is 
independent of the inertial frame from which it is measured and is, 
therefore, the type of quantity we are considering. 

If a value of V is associated with each point of a region of &y, an 
invariant field is defined over this region. In this case V will be a func- 
tion of the coordinates x;. Upon transformation to new axes, V will 
be expressed in terms of the new coordinates X;; when so expressed, it 
is denoted by V. Thus | 


V(X), X2,...,¥n) = V (x1, X25 +++) Xn) (11.2) 


is an identity. The reader should, perhaps, be warned that V is not, 
necessarily, the same function of the x; that Vis of the x;. 

If A,;is a tensor, it is obvious that VA, is also a tensor of the second 
rank. It is therefore convenient to regard an invariant as a tensor of 
zero rank. 

Consider the N partial derivatives 0V/0x;. These transform as a 
vector. To prove this it will be necessary to examine the transformation 
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inverse to (8.1). In the matrix notation of section 8, this may be written 


x = A7!-b) = A’(&—-D), (11.3) 
having made use of equation (8.11). Equation (11.3) is equivalent to 
xi = a;(X;—5;), ad 1.4) 
where di, is the ij" element of A’. But aj; = a;, and hence 
x; = a;i(X; — 5)). (11.5) 
It now follows that 
ox i 
ax, = aj (11.6) 


and hence that 


(11.7) 


proving that 2V/dx; is a vector. It is called the gradient of V and is 
denoted by grad V or VV. 

If a tensor Aj, is defined at every point of some region of &y, the 
result is a tensor field. The partial derivatives 0A, /0x, can now be 
formed and constitute a tensor whose rank is one greater than that of 
A,,.,. We shall prove this for a second rank tensor field A,. The 
argument is easily made general. We have 


ody 


a, ~ Ok (Qj, @js Ars), 


Ox; 


I 


a 
ax, i, Qjs A,;) ax, > 
= bite (11.8) 
J ox, 
by equation (11.6). 
12. Contraction. Scalar product. Divergence 


If two indices are made identical, a summation is implied. Thus, 
consider A,,. Then 


Ay = Ajyyy + Ajno+...+ Ann: (12.1) 
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There are N? quantities Aj. However, of the indices in Aj, only i 
remains free to range over the integers 1,2,..., N, and hence there are 
but N quantities A, and we could put B; = A,;. The rank has been 
reduced by two and the process is accordingly referred to as 
contraction. 

Contraction of a tensor yields another tensor. For example, if 
B; = Ay; then, employing equations (9.2), 


B; = Ay = QigGjr js Agrs = GigSrs Agrs = Gig Agrr = ig B- (12.2) 


Thus B;is a vector. The argument is easily generalized. 

In the special case of a tensor of rank two, e.g. Aj, it follows that 
Aj = Ajj, i.e. Aj; is an invariant. Now, if A;, B; are vectors, A;B; is a 
tensor. Hence, A,B; is an invariant. This contracted product is called 
the inner product or the scalar product of the two vectors. We shall 
write 

A,B; = A-B. (12.3) 


In particular, the scalar product of a vector with itself is an invariant. 
The positive square root of this invariant will be called the magnitude 
of the vector. Thus, if A is the magnitude of A;, then 


A? = A,A; = A-A = A’. (12.4) 
In &3, if 0 is the angle between two vectors A and B, then 
ABcos 0 = A-B. (12.5) 
In &y, this equation is used to define 0. Hence, if 
A-B = 0, (12.6) 


then @ = 47 and the vectors A, B are said to be orthogonal. 

If A; is a vector field, 2A,/2x; is a tensor. By contraction it follows 
that 0A,/2x; is an invariant. This invariant is called the divergence of 
A and is denoted by divA. Thus 


divA = —: (12.7) 


More generally, if Aj... is a tensor field, 2A,___/0x, is a tensor. This 
tensor derivative can now be contracted with respect to the index r 
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and any other index to yield another tensor, e.g. 2Ay__,/@x;. This con- 
traction is also referred to as the divergence of Aj, with respect to 
the index j and we shall write 


OAy... 


= div, Ay... (12.8) 


13. Tensor densities 
W, is a tensor density if, when the coordinates are subjected to the 
transformation (8.1), its components transform according to the law 


Gj = |Alan aya, (13.1) 


|A| being the determinant of the transformation matrix A. Since for 
orthogonal transformations |A| = +1 (equation (8.10)), relative to 
rectangular Cartesian frames, tensors and tensor densities are identi- 
cal except that, for certain changes of axes, all the components of a 
density will be reversed in sign. For example, if in &, a change is made 
from the right-handed system of axes to a left-handed system, the 
determinant of the transformation will be — 1 and the components of 
a density will then be subject to this additional sign change. 

Let e,;..., bea density of the N th rank which is skew-symmetric with 
respect to every pair of indices. Then all its components are zero, 
except those for which the indices i, j, ..., n are all different and forma 
permutation of the numbers 1, 2, ..., N. The effect of transposing any 
pair of indices in ey, is to change its sign. It follows that if the 
arrangement i, j, ..., n can be obtained from 1, 2, ..., N by an even 
number of transpositions, then ej, = +12... , whereas if it can 
be obtained by an odd number e;,_, = —€12...n- Relative to the x;- 
axes let e12...~ = 1. Then, in this frame, ey ,is 0 if i,j, ...,n is nota 
permutation of 1, 2,..., N, is +1 ifitis an even permutation and is — 1 
if it is an odd permutation. Transforming to the X,-axes, we find that 


@12...n = |A]ai;z---4nny,..n = |Al? = 1. (13.2) 


But é,___,, is also skew-symmetric with respect to all its indices, since 
this is a property preserved by the transformation. Its components 
are also 0, +1 therefore and e,___, is a density with the same com- 
ponents in all frames. It is called the Levi-Civita tensor density. 
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It may be shown without difficulty that: 
(i) the sum or difference of two densities of the same rank is a 
density, 
(ii) the product of a tensor and a density is a density, 
(iii) the product of a density and a density is a tensor, 
(iv) the partial derivative of a density with respect to x; is a 
density, 
(v) acontracted density is a density. 
Thus, to prove (iii), let U,;, 8; be two vector densities. Then 


GT, B; = | Al? ajay) By, B, = Qj, 41 Uy B). (13.3) 


The method is clearly quite general. The remaining results will be left 
as exercises for the reader. 


14. Vector products, Curl 
Throughout this section we shall be assuming that N = 3, i.e. the space 
will be ordinary Euclidean space. 

Let A, be a skew-symmetric tensor. Defining 


UW, = sey, Ay, (14.1) 
UW, is a vector density. Its components are 
MW, = 4(A23—A32) = Aas, 
WA, = 4(A43;—Aj3) = Asi, (14.2) 
U3 = $(Aj2— Az) = Ara, 


i.e. are the three distinct non-zero components of A. We have proved, 
therefore, that these three components of any skew-symmetric tensor 
of the second rank may be regarded as the components of a vector 
density. 

It is easy to verify that the inverse relationship to (14.1) is 


Ay = ex Uy. (14.3) 


Let A;, B; be two vectors. From these we can form a skew-symmetric 
tensor of the second rank C; such that 


Cy = A, B;— Aj B;. (14.4) 
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From C;; we can then form a vector density, viz. 
C; = ben Cre = byjx(Aj; Be — Ay By) = Cx ApBy, (14.5) 
whose components are 
©, = A, B;—A; Bo, 
Cc, = A,B,-—A, B;, (14.6) 
C3 = Ay By — Az By. 
Provided we employ only right-handed systems of axes or only left- 
handed systems in &3, ©; is indistinguishable from a vector. If, 
however, achange is made froma left-handed system to a right-handed 
system, or vice versa, the components of ©, are multiplied by —1 in 
addition to the usual vector transformation. Since it is usual to employ 
only right-handed frames, ©; is often referred to as a vector (or an 


axial vector) and treated as such. It is then called the vector product of 
A and B and we write 


© = AXB. (14.7) 
Vector multiplication is non-commutative, for 
BXA = ¢4,B Ay = — Cipj 4, Bi = —AXB, (14.8) 
having made use of e¢;= —¢ij,. However, vector multiplication 
obeys the distributive law, for 


AxX(B+C) Ck Aj(B,+ C,) = Ck A; B+ ik A; C; 


AXB+AXC. (14.9) 


Again, if A; is a vector field, we can construct the skew-symmetric 
tensor of the second rank 


Aj,i— Ai,j- (14.10) 


We have introduced here the abbreviated notation A; ; = 0A,/@x; for 
partial derivatives with respect to the coordinates. This will be made 
use of in many later arguments. Corresponding to Rj, there is the 
vector density R, where 


Ri = bein Rix = byx(An,j—Ajw = yx Any (14.11) 
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This has components 


ea, (14.12) 


and is denoted by curlA. It, also, is an axial vector. 

Equation (14.5) can still be employed to define a vector product 
when either or both of the vectors A, B are replaced by vector densities. 
If only one is replaced by a vector density, the right-hand member of 
equation (14.5) will involve the product of two densities and a vector. 
The resulting vector product will then be a vector. Similarly, by re- 
placing A in equation (14.11) by a vector density, the curl of a vector 
density is defined as an ordinary vector. 


Exercises 2 
1. Show that, in two dimensions, the general orthogonal trans- 
formation has matrix A given by 


ie cos@ siné 
~ \—sin@ cosé 


Verify that |A| = 1 and that A~! = A’. Ty is a tensor in this space. 
Write down in full the transformation equations for all its com- 
ponents and deduce that 7;; is an invariant. 

2. X= Ax, 3 = BX are two successive orthogonal transformations 
relative to each of which 7; transforms as a tensor. Show that the 
resultant transformation ¥ = BAx is orthogonal and that 7), trans- 
forms as a tensor with respect to it. 

3. Show that contraction of the Levi-Civita density results in the 
zero tensor density. 

4. In &3, prove that 


curlgrad VV = 0, divcurlA = 0. 


36 TENSOR CALCULUS AND RELATIVITY 
5. In &3, prove that 
(i) Cit Cimn = 84m 8in— Sn Bis 
(ii) Cit Cikm = 251m 
6. In &3, show that 


2 
V2 V = divgrad V = cataa tag = oY. 
7. In &3, prove that 
curlcurlA = graddivA—V7A. 
(Hint: Employ Exercise 5 (i).] 


8. In &3, prove that 


(i) AxX(BxXC) = A-CB—A-BC, 

(ii) A-BxC=| A; B, C; 
A, Bb CO 
A, B; C; 


9. In &y, prove that 
div VA = VdivA+Acgrad V. 


10. In &3, prove that 
(i) curl VA = VcurlA—Ax grad V, 
(ii) div(AXB) = B-curlA—A-curlB, 
Gii) curl(AXB) = B-VA—A-VB+AdivB— B divA, 
(iv) grad(A-B) = B-VA+A-VB+AxXcurlB+BxcurlA, 
where A-VB = A,B; ;. 


11. If Ay isa tensor and B, = A;;, prove that B, is a tensor. Deduce 
that if A, is symmetric in one frame, it is so in all. 
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12. Prove that 845i = Oy 
and that ej; ¢/mn, has the value + 1 if i, j, k are all different and (/mn) 
is an even permutation of (ijk), —1 if i, j, k are all different and (/mn) 
is an odd permutation of (ijk), and 0 otherwise. Deduce that 
Ck Cimn = 941 8jm Sku Bim Sin Skt 8indj1 8km 
~ Sin Sim 8x1 ae 84 Sin Sim _ Sim 8 Skn- 

Hence prove that 


Ck Cima = Sim Skn > Sin Sime 


(M. T.) 
13. In &3, prove that 
(Gj) (axb)-(cxXd) = arch-d—aedbec, 
(ii) (aXb)X(cxXd) = [acd] b—|[bed]a 
= [abd]c—[abc]d, 
where [abc] = aebxXc. 


CHAPTER 3 


Special Relativity Mechanics 


15. The velocity vector 

Suppose that a point P is in motion relative to an inertial frame S. 
Let ds be the distance between successive positions of P which it 
occupies at times t, t+ dt respectively. Then, by equation (7.4), if dr 
is the proper time interval between these two events, 


1/2 
dr = (a? 2,8?) = (1—v*/c”)!/? dt, (15.1) 
where v = ds/dt is the speed of P as measured in S. Now, as shown in 
section 7, dr is the time interval between the two events as measured 
in a frame for which the events occur at the same point. Thus d7 is 
the time interval measured by a clock moving with P. dt is the time 
interval measured by clocks stationary in S. Equation (15.1) indicates 
that, as observed from S, the rate of the clock moving with P is slow 
by a factor (1—v7/c”)'/?, This is the phenomenon of time dilation 
already commented upon in section 6. If P leaves a point A at t= t; 
and arrives at a point B at t = ft), the time of transit as registered by a 
clock moving with P will be 


ts 
12-71 = i] (1 —v*/c*)"? dt. (15.2) 
ht 


The successive positions of P together with the times it occupies 
these positions constitute a series of events which will lie on the point’s 
world-line in Minkowski space-time. Erecting rectangular axes in 
space-time corresponding to the rectangular Cartesian frame S, let 
x;, x; + dx; be the coordinates of adjacent points on the world-line. 
These points will represent the events (x,y,z, 1), (x+dx,y+dy,z+dz, 
t+dt) in S. If (v,,v,, vz) are the components of the velocity vector v 
of P relative to S, then 

dx dy dz 


gt on ep (15.3) 


Te pe OR oe dt 
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v does not possess the transformation properties of a vector relative 
to orthogonal transformations (i.e. Lorentz transformations) in space- 
time. It is a vector relative to rectangular axes stationary in S only. 
However, we can define a 4-velocity vector which does possess such 
properties as follows: dx; is a displacement vector relative to rect- 
angular axes in space-time and dz is an invariant. It follows that dx;/dr 
is a vector relative to Lorentz transformations expressed as orthogonal 
transformations in space-time. It is called the 4-velocity vector of P 


and will be denoted by V. 
V can be expressed in terms of v thus: 
dx;  dx;dt adie. 
a = ares = (1—v/c*)~ 1/7 x, (15.4) 


by equation (15.1). Also, from equations (4.4) we obtain 
Xy =z, X2= Vy X32 = Uz, Xq = Ic. (15.5) 
It now follows from these equations that 
V = (1-v/c?)—'(v,, 0,, 0,, ic) = (1—v?/c*) "709, ic), (15.6) 


where the notation should be clear without further explanation. 

Knowing the manner in which the components of V transform when 
new axes are chosen in space-time, equation (15.6) enables us to calcu- 
late how the components of y transform when S is replaced by a new 
inertial frame $. Thus, consider the orthogonal transformation (5.1) 
which has been interpreted as a change from an inertial frame S to 
another S$ related to the first as shown in Fig. 2. The corresponding 
transformation equations for V are 


V, = Vi cosa+ V,gsin a, Vz = Vo, (15.7) 
V4 = — V;sine+ V4COs a, V3 = V3. : 
By equation (15.6), these equations are equivalent to 
(1-87 /c?) 2 6, = (1-7/2) "7, cos a+ icsin a), 
(1 -8/c?)— 17 6, = (1 —v"/c*)— "7 v,, 
(15.8) 


(1-8 fc?) 17 6, = (1-v*/c?) "7 v,, 
(1-8 /c?) 1/2 ie = (1 —v?/c*)*!/2(— 0, sin a + iccos a), 
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where ¥ is the velocity of the point as measured in the frame S. Substi- 
tuting for cosa, sin« from equations (5.7), equations (15.8) can be 
written 


Ox = QO(v,,—4), 
By = O(1—w/c)"v,, 
6, = Q(1—w/c?)'? v,, (15.9) 
1 = Q(1 —uv,/c’), 
1—6*/c* 1/2 
where Q= laa | (15.10) 


Dividing the first three equations (15.9) by the fourth, we obtain the 
special Lorentz transformation equations for v in their final form, viz. 


Le yD 
* ~~ 1=uo,/c? 


_ _ d-wiec’)!Pey 


Sar een. (15.11) 
_ — (vc)! v, 
7" 1~uv,/c? 


If u and v are small by comparison with c, equations (15.11) can be 
replaced by the approximate equations 


0, = U,-Uu, Dy, = Vy, GO, = Uz. (15.12) 


These are equivalent to the vector equation (1.1) relating velocity 
measurements in two inertial frames according to Newtonian 
mechanics. 

Since, by the fourth of equations (15.9) Q must be real, equation 
(15.10) implies that if 0 < c then v < c. Thus, if a point is moving with 
a velocity approaching c in S and § is moving relative to S with a 
velocity of the same order, the point’s velocity relative to S will still 
be less than c. Such a result is, of course, completely at variance with 
classical ideas. In particular, if a light pulse is being propagated along 
Ox so that v,=c, v,= v,=0, then it will be found that 6, =c, 
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0, = 0, = 0. This confirms that light is propagated with speed c in all 
inertial frames. 

The transformation inverse to (15.11) can be found by exchanging 

barred’ and ‘unbarred’ velocity components and replacing u by —u. 


16. Mass and momentum 

In section 2 it was shown that Newton’s laws of motion conform to 
the special principle of relativity. However, the argument involved 
classical ideas concerning space-time relationships between two 
inertial frames and these have since been replaced by relationships 
based upon the Lorentz transformation. The whole question must 
therefore be re-examined and this we shall do in this and the following 
section. 

We shall begin by considering the conservation of momentum 
equation (1.3) for the impact of two particles by which mass is defined 
in classical mechanics. Since the velocity vectors u,, etc. are not vec- 
tors relative to orthogonal transformations in space-time and indeed 
transform between inertial frames in a very complex manner, it is at 
once evident that equation (1.3) is not covariant with respect to trans- 
formations between inertial frames. It will accordingly be replaced, 
tentatively, by another equation, viz. 


M,U,;+M2U, = M,V,;+™M2V2, (16.1) 


where Uj, etc. are the 4-velocities of the particles and M,, M) are 
invariants associated with the particles which will correspond to their 
classical masses. This is a vector equation and hence is covariant with 
respect to orthogonal transformations in space-time as we require. 
Equation (16.1) will be abbreviated to the statement 


> MV is conserved (16.2) 
and then, by equation (15.6), this implies that 
> m(vy, ic) is conserved, (16.3) 


M 


where m= davyeyie . 


(16.4) 
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By consideration of the first three (or space) components of (16.3), it 
will be clear that 
. >) mv is conserved, (16.5) 


and, by consideration of the fourth (or time) component that 
> m is conserved. (16.6) 


If, therefore, mm is identified as the quantity which will play the role of 
the Newtonian mass in special relativity mechanics, our tentative 
conservation law (16.1) is seen to incorporate both the principles of 
conservation of momentum and of mass from Newtonian mechanics. 
The principle (16.1) is accordingly eminently reasonable. However, 
our ultimate justification for accepting it is, of course, that its conse- 
quences are verified experimentally. We shall refer to such checks at 
appropriate points in the later development. 

It appears from equation (16.4) that the mass of a particle must now 
be regarded as being dependent upon its speed v. Ifv = 0, then m = M. 
Thus M is the mass of the particle when measured in an inertial frame 
in which it is stationary. M will be referred to as the rest mass or proper 
mass and will, in future, be denoted by mp. Then 


m= aaa (16.7) 


Clearly m— 0 as v—>c, implying that inertia effects become increas- 
ingly serious as the velocity of light is approached and prevent this 
velocity being attained by any material particle. This is in agreement 
with our earlier observations. Formula (16.7) has been verified by 
observation of collisions between atomic nuclei and cosmic ray par- 
ticles (e.g. see Exercise 14 at the end of this chapter). 

We shall define the 4-momentum vector P of a particle whose proper 
mass is 77) and whose 4-velocity is V, by the equation 


P = mV. (16.8) 


Since mm is an invariant and V is a vector in space-time, P is a vector. 
By equation (15.6), 


P = m1 —v*/c*)— 1/2 (vy, ic) = (mv, imc) = (p,imc), (16.9) 
where p = mv is the classical momentum. 
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Relative to the special orthogonal transformation (5.1), the trans- 
formation equations for the components of P are 
P, = P, cosa+P,sin «, P, = P», 
P, = —P, sin a+ P, cos «, P; = P3. 
Substituting for the components of P from equation (16.9) and 
similarly for P, and employing equations (5.7), it will be found that 


(16.10) 


rae Py—mu , 
, = oe (16.11) 
Dz = Pz 

m—pxule” (16.12) 


m= ——° 
(1—u?/c*)!/? 
Equations (16.11) constitute the special Lorentz transformation equa- 
tions for the components of the momentum p and equation (16.12) the 
corresponding transformation equation for mass. Since p, = mv,, 
this equation can also be written 
1—uv |e? 

(—u2/c2)12 m. 
This reduces to the classical form of equation (2.4) if u, v, are neg- 
ligible by comparison with c. 


(16.13) 


nmi= 


17. The force vector. Energy 

We have seen that in classical mechanics, when the mass of a particle 
has been determined, the force acting upon it at any instant is specified 
by Newton’s second law. Force receives a similar definition in special 
relativity mechanics. The mass of a particle with a given velocity can 
be determined by permitting it to collide with a standard particle and 
applying the principle of momentum conservation. Equation (16.7) 
then gives its mass at any velocity. The force f acting upon a particle 
having mass m and velocity v relative to some inertial frame is then 
defined by the equation, 


ye 
f= Fm) =~, (17.1) 


44 TENSOR CALCULUS AND RELATIVITY 


where p is the particle’s momentum. Clearly f will be dependent upon 
the inertial frame employed, a departure from classical mechanics. 

Definition (17.1) implies that, if equal and opposite forces act upon 
two colliding particles, momentum is conserved. However, although 
experiment confirms that momentum is indeed conserved, Newton’s 
third law cannot be incorporated in the new mechanics, for it will 
appear later that, if the forces are equal and opposite for one inertial 
observer, in general they are not so for all such observers. Equation 
(16.1) therefore replaces this law in the new mechanics. 

f is not a vector with respect to Lorentz transformations in space- 
time. However, a 4-force F can be defined which has this property. 
The natural definition is clearly 


(17.2) 


P being the 4-momentum and 7 the proper time for the particle. F is 
immediately expressible in terms of f for, by equation (16.9), 


F= <0, ime), 


dad, . dt 
= 5 Pe ime) > 
= (1—v*/c?)~ 1, ime), 


= (1—v*/c*)— "2 (£, ime). (17.3) 


The vectors V, F are orthogonal. This is proved as follows: From 
equation (15.6) 


V2 = —c’, (17.4) 
Differentiating with respect to 7, 
dV 
ae — 0, 
i.e. V-F = 0, (17.5) 


as stated. This result has very important consequences. Substituting 
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for V and F from equations (15.6) and (17.3) respectively, it is clear 
that 


(1—w7/c?)~!(v, ic), imc) = 0. (17.6) 


This is equivalent to 
v-f—c*?m = 0. (17.7) 


But, by definition, v-f is the rate at which f is doing work. It follows 
that the work done by the force acting on the particle during a time 
interval (t;, t2) is 


ts 
[ ema = m,c*?—m,c*. (17.8) 
th 


The classical equation of work is 
work done = increase in kinetic energy, (17.9) 


where T = 4nv” is the kinetic energy. Equation (17.8) indicates that 
in special relativity mechanics we must define T by a formula of the 
type 

T = mc* +constant. (17.10) 


When v = 0, T= 0 and this determines the unknown constant to be 
— mpc’. Thus 


2 
T = ayaa (17.11) 


If v/c is small, (1 —v?/c?)~'/2 = 1+07/2c* approximately and the 
above equation reduces to T= 47mpv", in agreement with classical 
theory. 

According to equation (17.10), any increase in the kinetic energy 
of a particle will result in a proportional increase in its mass. Thus, if 
a body is heated so that the thermal agitation of its molecules is 
increased, the masses of these particles, and hence the total body mass, 
will increase in proportion to the heat energy which has been 
communicated. 

Again, suppose two equal elastic particles approach one another 
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along the same straight line with equal speeds v. If their proper masses 
are both mo, the net mass in the system before collision is 


2mo/(1 —v*/c*)'/?, 


It has been accepted as a fundamental principle that this mass will 
be conserved during the collision. However, from considerations of 
symmetry, it is obvious that at some instant during the impact both 
particles will be brought to rest and their masses at this instant will 
be proper masses m,. By our principle, 


pig Sian LOO. 

mo = (l—v/c2)'2 

It follows, therefore, that, at this instant, the proper mass of each 
particle has increased by 


(17.12) 


mo 
Gay Sia” = T/c’, (17.13) 
where T is the original KE of the particle and use has been made of 
equation (17.11). Now, in losing this KE, the particle has had an equal 
amount of work done upon it by the force of interaction and this has 
resulted in a distortion of the elastic material of which it is made. 
At the instant each particle is brought to rest, this distortion is at a 
maximum and the elastic potential energy as measured by the work 
done will be exactly 7. If we assume that this increase in the internal 
energy of the particle leads to a proportional increase in mass, the 
increment of rest mass (17.13) is explained. If the particles are not 
perfectly elastic, the work done in bringing them to rest will not only 
increase the internal elastic energy, but will also generate heat. Both 
forms of energy will then contribute to increase the proper masses. 
Such considerations as these suggest very strongly that mass and 
energy are equivalent, being two different measures of the same 
physical quantity. Thus, the distinction between mass and energy 
which was maintained in classical physical theories, has now been 
abandoned. All forms of energy £, mechanical, thermal, electromag- 
netic, are now taken to possess inertia of mass m, according to 
Einstein’s Equation, viz. 
E = me’. (17.14) 
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Conversely, any particle whose mass is m, has associated energy E 
and, by equation (17.11), 


E=T+mc’. (17.15) 


mo’ is interpreted as the internal energy of the particle when station- 
ary. If the particle were converted completely into electro-magnetic 
radiation, moc” would be the energy released. This is the source of the 
energy released in an atomic explosion. The mass of the material 
products of the explosion is slightly less than the net mass present 
before the explosion, the difference being accounted for by the mass 
of the energy released. Even a small mass deficiency implies that an 
immense quantity of energy has been released. Thus, if m=1 gm, 
c= 3x10! cm/sec and hence E= 9 x 107° ergs = 2:5 x 10’ kilowatt 
hours. 

The principle of conservation of mass, which has been incorporated 
into the new mechanics, is now seen to be identical with the principle 
of conservation of energy, which is accordingly also regarded as valid 
in the new mechanics. However, the distinction between the two 
principles, which was a feature of the older mechanics, has 
disappeared. 


18. Lorentz transformation equations for force 
By equation (17.7), 


the = “fey, (18.1) 


Referring to equation (17.3), F can now be completely expressed in 
terms of f thus: 


F= a0 /e)-1"( 2): (18.2) 


Relative to the special Lorentz transformation, the transformation 
equations for the components of F are 


F, = F,cosa+ F4sin «, F, = oe 


18.3 
F, = —F,sine+F,cosa, F; = F3. ve: 
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Substituting from equation (18.2) into the first three of these equations 
and employing equations (5.7), it follows that 


Fe = o[f.- Zt), 
fy = Q1-w*/c?)"? fF, 
f, = Q1-w/c’)'? Ff, 


where Q is given by equation (15.10). Substituting for. Q from the 
fourth of equations (15.9), it will be found that 


(18.4) 


w= ge lh (fy vy +f, 02) 
Be Fe 2 wo ft 


1—w?c2)!? 
f, = atte 73 ie (18.5) 
pd (1—w?/c*)'? 
7 1=uv,/c? “* 


These are the special Lorentz transformation equations for f. If u, v 
are negligible by comparison with c, these equations reduce to the 
classical form of equation (2.6). 

It is clear from equations (18.5) that, if equal and opposite forces 
are observed from S to act upon two particles, the forces observed 
from § will not be so related unless the particles’ velocities are the 
same. 


19. Motion with variable proper mass 
In section 17 it has been assumed that the proper mass mpg of the 
particle which is moving under the action of the force f, is constant 
throughout the motion. If, however, the particle is being heated or 
cooled during its motion, or if any non-mechanical forms of energy 
are being communicated to it from an external source, its proper mass 
will vary and our equations must be modified to take account of this 
variation. 

Thus, consider equation (17.2). The 4-momentum P is still defined 
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by equation (16.8) but, since mg is variable, the 4-force is given by 


d dV ._ dm 
F = (mV) = mT tv (19.1 

Equation (17.4) remains valid and hence, differentiating, 

dV 

——— = ° 19.2 

dr ( ) 

Substituting for dV/dr from equation (19.1), we obtain 
di 
V-F= —-c—. (19.3) 
dr 


We conclude that V and F are no longer orthogonal vectors. Substi- 
tuting in equation (19.3) for V and F from equations (15.6) and (17.3) 
respectively, it will be found that 

a f-v+(c*-—v*) a (19.4) 
This is the modified equation of work. Its physical interpretation is 
clearly: 


rate of increase in particle’s energy 
= rate of doing work by applied force 
+rate of energy input from the external source. (19.5) 


We deduce that energy is being taken from the external source at a 
rate 
R= (c —v~)— (19.6) 
dr 
as measured in the inertial frame being employed. 
Equation (19.4) can therefore be written 


E=cCm=f-v+R ; (19.7) 
and hence, by equation (17.3), 
F = (1—v2/c*)—1? & (f-v+ n)|- (19.8) 


This is the modified form of equation (18.2). 


50 TENSOR CALCULUS AND RELATIVITY 
20. Lagrange’s and Hamilton’s equations 
Suppose that a particle having constant proper mass mp is in motion 


relative to an inertial frame under the action of a force derivable from 
a potential V. Then its equations of motion are 


d moX oVv 
—(>—5-5 75) = ~—, ete. 20. 
An —an} ax? Oy 
Expressed in Lagrange form, these equations must be 

d (aL aL 

—{—}) = —, etc. 20.2 

alas) a0.) 
and hence Z must be a function of x, y, z, x, ¥, Z, such that 


aL mo x a | OV 
a a ON ne, a ie. 20.3 
ax (1—v2/c?)"/?” ax ax a) 


Since v? = x7 + 7+ 27, these equations can be validated by taking 
L = —moc*(1—v?/c?)!? - V, (20.4) 


which is accordingly the Lagrangian for the particle. 

Now 
aL 
= = Px, ete. (20.5) 
ox 
and it follows exactly as in classical theory that, if the Hamiltonian H 
is defined by the equation 


H = p,vx+pyvyt+p,v,—L, (20.6) 


and is then expressed as a function of the quantities x, y, Z, Dx, Dy, Pz 
alone, the Lagrange equations (20.2) are equivalent to Hamilton’s 
equations 


oH oH 
t= SS, Pye ea ete. (20.7) 
op. ox 


Now 


mo vu 


PxyVUxtPyVy+Ppz,0, = (l—v?/c2)'? (20.8) 
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and hence 
mp c2 
= G-vyayint 
= E+V (20.9) 
the total energy, precisely as for classical theory. 
But 
2 a 
PEt PS +P: = ieee’ 
mic? 
= —mpel+ 1—-v2/c? 
= —mic*+E*/c*, (20.10) 
and it follows that 
E? = c(p2+p2+p2+ mc’). (20.11) 
Substituting in equation (20.9) 
H = c(p2+pt+p2+mgc’)'? + V, (20.12) 


expressing H as a function of x, y, Z, Px» Py» Pz- The reader is now left 
to verify that Hamilton’s equations are equivalent to the equations 
of motion (20.1). 


. Exercises 3 
1. Obtain the transformation equations for v by differentiating the 
Lorentz transformation. 
2. Obtain the transformation equations for the acceleration a by 
differentiating the transformation equations for v and express them 
in the form 


_ _d-#ley? | 
“x Ge,ulep” 
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tae? v, ule? 
ay = D2 \ Fy ay}, 
(1—v,, u/c?) 1—v,.u/c? 


: 1—u/?c? (< v, u/c ) 


“2 —e,uleyp\** 1—o,ue™ 


Deduce that a point which has uniform acceleration in one inertial 
frame has not, in general, uniform acceleration in another. 

3. If 5 has velocity c relative to S, show that all points moving 
relative to S with velocities less than c have a velocity c as observed 
from S$. 

4. Two points are moving in opposite directions with speeds c 
relative to some inertial frame. Show that their relative velocity is c. 

5. Show that the 4-velocity V is of constant magnitude ic. 

6. A beam of light is being propagated in the xy-plane of S at an 
angle « to the x-axis. Relative to S it is observed to make an angle « 
with Ox. Prove the aberration of light formula, viz. 


see Sis cot a — (u/c) COSEC a 
oa ee? 
Deduce that, if u<c, then 
o u., 
Ma = é—a = -sinag, 
c 


approximately. 
7. A particle of proper mass mg is moving under the action of a 
force f with velocity v. Show that 


___™%_ dv, _movd/c* 
d= 8c)? ae” 07/2” 


Hence, if the acceleration dv/dt is parallel to v, show that 


f 


pg a 
(1—v*/c?)?? dt’ 


and if the acceleration is perpendicular to v, then 


dv 


eal ee 
(1—v*/c*)"? dt 
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8. Show that P = (p,iE/c) and deduce that 


p—E*|c” 


is an invariant — m%c* with respect to Lorentz transformations. 
9. Show that E transforms under a special Lorentz transformation 
according to the equation 


E-p,u 
Fos es Ss 
(1—u2/c2) 12 

10. A rocket moves along the x-axis in S, commencing its motion 
with velocity vp and ending it with velocity v,. If w is the jet velocity 
as measured by the crew (assumed constant), show that the mass ratio 
of the manoeuvre (i.e. initial mass/final mass) as measured by the crew 
is 


E +0) (— cay" sie 
(c—v1) (c+) 


What does this reduce to as c—> 0? Deduce that, if the rocket starts 
from rest in S and its jet is a stream of photons, the mass ratio to 


velocity v is 
c+v : 
c—u 


Show that, with a mass ratio of 6, the rocket can attain 35/37 of the 
velocity of light in S. 

11. S, 5, § are inertial frames with their axes parallel. S$ has a 
velocity u relative to S and 5 has a velocity v relative to S, both velo- 
cities being parallel to the x-axes. If transformation from S$ to $ 
involves a rotation through an angle a of the axes in space-time and 
transformation from S to § a rotation f, a transformation from S to 
S involves a rotation y where y = «+8. Deduce from this equation 
the relativistic law for the composition of velocities, viz. 


u+UVU 


w= ——' 
1 +uv/c? 
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12. A force f acts upon a particle of mass m whose velocity is y. 
Show that 


13. An electrified particle having charge e and rest mass my Moves 
in a uniform electric field of intensity E parallel to the x-axis. If it is 
initially at rest at the origin, show that it moves along the x-axis so 


that at time t 
fly e$e)-| 


where k = eE/mo. Show that this motion approaches that predicted 
by classical mechanics as c-> oo. [It may be assumed that the force 
acting upon the particle is eF in the direction of the field at all times.] 

14. A particle is moving with velocity u when it collides with a 
stationary particle having the same rest mass. After the collision the 
particles are moving at angles 0, ¢ with the direction of motion of the 
first particle before collision. Show that 


tan #tangd = =e 
y+1 
where y = (1—u?/c?)—/?, (if c-> , y->1 and 0+ = 4m. This is the 
prediction of classical mechanics. However, if the particles are elec- 
trons and u is near to c in value, 0+¢ < 41. This effect has been 
observed in a Wilson cloud chamber.) [Hint: Refer the collision to an 
inertial frame in which both particles have equal and opposite 
velocities prior to coHision.] 
15. A body of mass M disintegrates while at rest into two parts of 
rest masses M, and M2. Show that the energies FE, E> of the parts are 
given by 


M? + _M?~ M3 M?—M?+ M3 
——_ 2?) ob = AP. 


na a 2M : 2M 


16. A particle of proper mass 7g moves under the action of a central 
force. (r,@) are its polar coordinates in its plane of motion relative to 
the force centre as pole. V(r) is its potential energy when at a distance r 
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from the centre. Obtain Lagrange’s equations for the motion in the 
form 


d 1 d 
— r)— 62 —Vy’=0 — 2 6 — 0, 
dt DEVE mo ie ” dt oF) 


where y = [1 — (#7 +r767)/c?]~ '/*. Write down the energy equation for 
the motion and obtain the differential equation for the orbit in the 
form 


du C-V 
2 , 
ates eagpeeeans / 
h? ( 7162 +u) me 
where u = 1/r and h, C are constants. In the inverse square law case 
when V= —p/r, deduce that the polar equation of the orbit can be 
written 
lu = 1+ecos7, 


where 7? = 1—p?/mih* c*. If w/mohc is small, show that the orbit is 
approximately an ellipse whose major axis rotates through an angle 
arp?/me hc? per revolution. 

17. A photon having energy E collides with a stationary electron 
whose rest mass is mo. As a result of the collision the direction of the 
photon’s motion is deflected through an angle 6 and its energy is 
reduced to E’. Prove that 


1 1 
———]} = 1-—cos0. 
mo c* ( rz =) 1—cos 
(it may be assumed that the momentum of a photon having energy E 
is E/c.) 

Deduce that the wavelength A of the photon is increased by 


Ata sine: 
moc 


where A is Planck’s constant. (This is the Compton Effect. For a 
photon, take A = hc/E.) 

18. A particle of rest mass m, and speed v collides with a particle 
of rest mass mz which is stationary. After collision the two particles 
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coalesce. Assuming that there is no radiation of energy, show that 
the rest mass of the combined particle is M, where 


2m, m2 
M 2= me + m3 + (1—02/c2)1/2 
and find its speed. 

19. A luminous disc of radius a has its centre fixed at the point 
(x,0,0) of the S-frame and its plane is perpendicular to the x-axis. It 
is observed from the origin in the S-frame at the instant the origins 
of the two frames coincide and is measured to subtend an angle 2a. 
Prove that, if a < x, then 


a j{ct+u 
al ia) 
XAj \c—u 


(Hint: employ the aberration of light formula, exercise 6 above.) 

20. Two particles having proper masses ™,, mz are moving with 
velocities u,, u2 respectively, when they collide and cohere. If « is the 
angle between their lines of motion before collision, show that the 
proper mass of the combined particle is m, where 


2m, m(c? — Uy U2 COS ac) ; 
V {(c?— uf) (c? — u9)} 


Show that, for all values of «, m > m+ mz and explain the increase 
in proper mass. 

21. v, ¥ are the velocities of a point relative to the inertial frames 
S, S respectively. Representing these vectors as position vectors in an 
independent &3, show that 


m2 = mi+mz+ 


pi =O fren e- v+el| 
where f = (1 —u2/c2)~1/2 and 
Q = 1/(l+uev/c?). 
Show further that 
u2 By = Q[(i—P)ux(vxu)+ fu2({u+ y)], 
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and hence verify that 


o? = Q%{(u+v)?—(v x u)?/c?]. 


22. A particle moves along the x-axis of the frame S with velocity 
v and acceleration a. Show that the particle’s acceleration in S is 


(1 — u2/c2)3/2 


oop“ 


If the particle always has constant acceleration a relative to an 
inertial frame in which it is instantaneously at rest, prove that 


Sb) =, 


where B = (1 —v2/c2)~1/2 and t is time in S. 
Assuming that the particle is at rest at the origin of S at t=0, 
show that its x-coordinate at time ¢ is given by 


ax = c2[(1 + a2 t2/c2)/2— 1]. 


23. Three rectangular cartesian inertial frames S, 5, S are initially 
coincident. As seen from S, S moves with velocity u parallel to Ox 
and, as seen from §, § moves with velocity vw parallel to OF. If the 
direction of §’s motion as seen from S' makes an angle 6 with Ox and 
the direction of S’s motion as seen from 5 makes an angle ¢ with OR, 
prove that 


Uv uz \1/2 Uv y2\71/2 
tanOd=-{1-—]} , tand = -{1-— ‘ 
( “) b °( =) 


Deduce that, if u, v <c, then 
$—0 = uv/2c? 


approximately. 

24. A particle is moving with velocity v when it disintegrates into 
two photons having energies E,, E,, moving in directions making 
angles «, 8 with the original direction of motion and on opposite 
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sides of this direction. Show that 
cC—WvU 


tan 4a tan 4B = oie 7 


Deduce that, if a photon disintegrates into two photons, they must 
both move in the same direction as the original photon. (The momen- 
tum of a photon having energy E is E/c.) 


CHAPTER 4 
Special Relativity Electrodynamics 


21. 4-Current density 

In this chapter we shall study the electromagnetic field due to a flow 
of charge which will be assumed known. Relative to an inertial frame 
S, let p be the charge density and yv its velocity of flow. Then, if j is 
the current density, 


j = pv. (21.1) 


Assuming that charge can neither be created nor destroyed, the 
equation of continuity 


divj+— =0 (21.2) 


will be valid for the charge flow in S. This equation must be valid in 
every inertial frame and hence must be expressible in a form which is 
covariant with respect to orthogonal transformations in space-time. 
Introducing the coordinates x; by equations (4.4) and employing 
equation (21.1), equation (21.2) is seen to be equivalent to 


ie Ont = (pe) +5 = po) + a = (ep) = (21.3) 


This equation is covariant as required if (pv,, pvy, pz, icp) are the 
four components of a vector in space-time. For, if J is this vector, 
equation (21.3) can be written 

Ji; = 0, (21.4) 


and this is covariant with respect to orthogonal transformations. 
‘Now, by equation (15.6), 


J = (pv, icp) = p(l—v?/c?)'/? Vv, (21.5) 
[59] 
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where V is the 4-velocity of flow and hence J is a vector if p(1 — v?/c?)!/2 
is an invariant. Denoting the invariant by po, we have 


Pp = aaa (21.6) 


It follows that p = po if v = 0 and hence that pp is the charge density 
as measured from an inertial frame relative to which the charge being 
considered is instantaneously at rest. pg is called the proper charge 
density. 

J is called the 4-current density and it is clear from equation (21.5) 
that 

J = poV = Gi, icp). (21.7) 
It is now clear that, when J has been specified throughout space-time, 
the charge flow is completely determined, for the space components 
of J fix the current density and the time component fixes the charge 
density. Hence, given J, the electromagnetic field must be calculable. 
The equations which form the basis for this calculation will be derived 
in the next two sections. 

Let dwo be the volume of a small element of charge as measured 
from an inertial frame Sp relative to which the charge is instantane- 
ously at rest. The total charge within the element is ppdwo. Due to the 
Fitzgerald contraction, the volume of this element as measured from 
S will be dw, where 


dw = (1—v*/c*)"/? dag. (21.8) 
The total charge within the element as measured from 5S is therefore 
pdw = p(1—v?/c?)"!? day = po du, (21.9) 


by equation (21.6). It follows that the electric charge on a body is 
invariant for all inertial observers. 


22. 4-Vector potential 

In classical theory, the equations determining the electromagnetic 
field due to a given charge flow are Maxwell’s equations (3.1)—(3.4). 
To ensure covariance of the laws of mechanics with respect to Lorentz 
transformations, it proved necessary to modify classical Newtonian 
theory slightly. However, it will be shown that Maxwell’s equations 
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are covariant without any adjustment being necessary. Indeed, the 
Lorentz transformation equations were first noticed as the transfor- 
mation equations which leave Maxwell’s equations unaltered in form. 
To prove this, it will be convenient to introduce the scalar and vector 
potentials, ¢ and A respectively, of the field. It is proved in textbooks 
devoted to the classical theoryf that A satisfies the equations 


7} 
diva +1 = 0, (22.1) 
c ot 
128A 4rr 
VA-+—> = -— ij, 22. 
c? at? a a) 
and ¢ satisfies the equation 
12¢ 
2 ROP 
V?b—3 53 = ~4np. (22.3) 


We now define a 4-vector potential 9 in any inertial frame S by the 
equation 

Q = (A, i¢). (22.4) 
It is easily verified that equations (22.2), (22.3) are together equivalent 
to the equation 


4 
O72 = - = J, (22.5) 
where the operator [7 is defined by 
ee (22.6) 
~ axt” ax3' ax3° ax} 


The space components of equation (22.5) yield equation (22.2) and 
the time component, equation (22.3). If 2;, J; are the components of 
S2 and J respectively, equation (22.5) can be written 


4r 
Q, 4; = - = Ji (22.7) 


in which form it is clearly covariant with respect to Lorentz transfor- 
mations provided & is a vector. This confirms that equation (22.4) 


t See, e.g., A Course in Applied Mathematics by D. F. Lawden, p. 527. 
English Universities Press. 
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does, in fact, define a quantity with the transformation properties of 
a vector in space-time. 

Next, it is necessary to show that equation (22.1) is also covariant 
with respect to orthogonal transformations in space-time. It is clearly 
equivalent to the equation 


div 2 = 2; ; = 0, (22.8) 


which is in the required form. 
J being given, 92 is now determined by equations (22.7) and (22.8). 


23. The field tensor 
When A and ¢ are known in an inertial frame, the electric and magnetic 
intensities E and H respectively at any point in the electromagnetic 
field follow from the equations 

102A 


=o 


7, (23.1) 


H = curlA. (23.2) 


Making use of equations (4.4) and (22.4), these equations are easily 
shown to be equivalent to the set 


02, @Q,; 
ox 1 Ox, ? 
aQ, aQ, 
x2 Ox4 ; 
a2, aQ 
—fF a 


0x3 Ox4 ) 


_ @Q; AQ, 
- Ox2 0x3 : 


~iE, = 


(23.3) 


= I. 3, (23.4) 


See ee 
. 
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Equations (23.3), (23.4) indicate that the six components of the vec- 
tors — iE, H with respect to the rectangular Cartesian inertial frame S 
are the six distinct non-zero components in space-time of the skew- 
symmetric tensor {2; ;— 2 ;. We have proved, therefore, that equa- 
tions (23.1), (23.2) are valid in all inertial frames if 


0 4H, —-H, —iE, 
-H, 0 4H, —iE, 
H, —H, 0 —iE, 
iE, iE, iE, 0 


(Fy) = (23.5) 


is assumed to transform as a tensor with respect to orthogonal trans- 
formations in space-time. The equations (23.3), (23.4) can then be 
summarized in the tensor equation 


Fy = 2, ,— Qi ;. (23.6) 


Fi, is called the electromagnetic field tensor. The close relationship 
between the electric and magnetic aspects of an electromagnetic field 
is now revealed as being due to their both contributing as components 
to the field tensor which serves to unite them. 

Consider now equations (3.2), (3.3). Employing the field tensor 
defined by equation (23.5) and the current density given by equation 
(21.7), these equations are seen to be equivalent to 

OF 2 , Fis , Fig _ 4 


—J 
Ox» 0x3 Ox4 Cc. - 


OF. OF- OF. 4 
Met, oe os =p, 


Ox, 9x3 0x4 
OF 3, OF 32 OF 34 ™ 4r 
Ox 1 ax2 Ox4 c 
OF4, OF 42 OF. ar 
OF 41, Fan | O43 _ 

Ox, 9X2 Ox; 


(23.7) 


or, in short, 


4n 
Fi,j = ue (23.8) 
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an equation which is covariant with respect to Lorentz trans- 


formations. 
Finally, consider equations (3.1) and (3.4). These can be written 


OF 34. OF 4, OF- 
w7 34 piace _* 23 = 0, 
Ox 0x3 Ox4 
OF,, OF 1, oF: 

41 ie 13 + pine 0 
0x3 0x4 Ox, 
OF {2 OF 24 | OF 41 

ponearenr + tees 
x4 Ox Ox2 
OF,3 OF, OF; 

23 Lae 12 - 0 

8x, Ox, 0x3 


(23.9) 
= 0, 


These equations are summarized thus: 
Fy nt Fjx,it Fri, = 0. (23.10) 


If any pair from i, j, kK are equal, since Fj; is skew-symmetric, the left- 
hand member of this equation is identically zero and the equation is 
trivial. The four possible cases when i, j, K are distinct are the equations 
(23.9). Equation (23.10) is a tensor equation and is therefore also 
covariant with respect to Lorentz transformations. 
To sum up, Maxwell’s equations in 4-dimensional covariant form 
are: 
Fiy,j = © 4, 
c (23.11) 


Fy, c+ Fixit Fri,j = 9. 


Given J; at all points in space-time, these equations determine the 
field tensor F;;. The solution can be found in terms of a vector poten- 
tial 2; which satisfies the following equations: 


Q:; ; = 0, 


an (23.12) 
Q; j; = = poe 
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Q; being determined, Fj; follows from the equation 


F; = 2; »— Qj. (23.13) 


24. Lorentz transformations of electric and magnetic intensities 
Since F, is a tensor, relative to the special Lorentz transformation 
(5.1) its non-zero components transform thus: 


F,3 = Fy3, 
F;, = F3,cosa+ F34sin «, (24.1) 
Fi. = Fy. cosa+ F42 sin a, 
Fig = Fra, 
Fo, = —Fysinat+ Fy4cosa, (24.2) 
Fy4 = —F3,sina+ F34cos a. 


Substituting for the components of F;; from equation (23.5) and for 
sin «, cos from equations (5.7), the above equations (24.1) yield the 
special Lorentz transformation equations for H, viz. 


Hy, + (u/c) E, 


_ H,—-leyE, 
(1—u?/c)"/?” 


acer ou 


Similarly, equations (24.2) yield the transformation equations for E, 
viz. 


A,=H,, A, = A, (24.3) 


S E,—(ule) H, B= E,+ (u/c) Hy | 
~ w/c??? (1 — u/c?) 


As an example of the use to which these transformation formulae 
may be put, consider the electromagnetic field of a point charge e 
moving uniformly with speed u relative to the observer. Let S be the 
inertial frame employed by the observer and suppose e moves along 
its x-axis. S will be a parallel inertial frame with the point charge 
stationary at its origin. Consider the instant t = 0 in S, when the point 
charge is at the origin of S and the origins of Sand S coincide. Foran 
observer employing 5, the electromagnetic field is that due to a 


E,=E,, E, (24.4) 
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stationary point charge and is accordingly specified at the point 
(x, ¥,Z) for all ~ by the equations 


= 5(%,5,2), A =0, (24.5) 
r 


where F” = £7+ 57+ 27. The field in S can now be calculated from the 
transformation equations inverse to (24.3), (24.4) (replace u by —u) 
and proves to be given by 


elec De aod 
~ BL wey?’ Gwe?) 


(24.6) 
He e 0 uz/c uy/c 
PL? d=)?!" dw /c*y'? 
But, putting t = 0 in equations (5.8), it will be found that 
g i jay, i= 24 
a pene 2 en (24.7) 
Whence, equations (24.6) are equivalent to 
e 
— P31 — 2/21? (x 9VoZ )s 
(24.8) 


eu/c 
H = FU w/e O —Z,y), 


2 
Xx 

where r? = —___ + y* 4 2?, 

(w/e) ” 


If r is the position vector of the point (x, y, z) with respect to the origin 
of S, equations (24.8) can be written 


e 
E = =a 
rd — u?/c?)'/? 


(24.9) 
1 e 


H = -~,—*, a jpuxr. 
er — wero" 
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These equations indicate that, at this instant in S, the E-lines are 
straight lines radiating from O and the H-lines are circles whose 
centres lie on Ox and whose planes are parallel to Oyz. 

If (u/c)? is considered to be negligible, equations (24.9) reduce to 


1 
E=5r, H=-5uxr. (24.10) 
r cr 


The first equation shows that the electric field is, to this order of 
approximation, identical with the field of a stationary charge and the 
second equation is the Biot-Savart Law. 


25. The Lorentz force 
We shall now calculate the force exerted upon a point charge e in 
motion in an electromagnetic field. 

At any instant, we can choose an inertial frame relative to which the 
point charge is instantaneously at rest. Let Ep be the electric intensity 
at the point charge relative to this frame. Then, by the physical 
definition of electric intensity as the force exerted upon unit stationary 
charge, the force exerted upon e will be cE. It follows from equation 
(18.2) that the 4-force acting upon the charge in this frame is given by 


F = (€Ep, 0). (25.1) 
The 4-velocity of the charge in this frame is also given by 
V = (0, ic) (25.2) 


and hence, by equation (23.5), 
“Fy V; = e(Exos Ey: Ex, 0) = (Eo, 0). (25.3) 


It has accordingly been shown that, in an inertial frame relative to 
which the charge is instantaneously stationary, 


Fe : Fy Vj. (25.4) 


But this is an equation between tensors and is therefore true for all 
inertial frames. 
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Substituting in equation (25.4) for the components F;, F,;, V; from 
equations (18.2), (23.5) and (15.6) respectively, the following equations 
are obtained: 


e 
Se = ~ (Hz 0y— Hyv, + cE,), 
é 
h= ~ (Hx 02— Hz 05+ cE,), (25.5) 


e 
= 7 Hy Ox — Hy by + CE;). 
These equations are equivalent to the 3-vector equation 
e 
f= eh +-“VXH. (25.6) 


f is called the Lorentz force acting upon the charged particle. 


26. Force density 

Consider a continuous distribution of matter in motion under the 
action of some field of force. Let dwo be the proper volume of any 
small element of the distribution and let F be the 4-force exerted upon 
the element by the field. Writing 


F = Ddw, (26.1) 


it follows that, since F is a vector and dw is an invariant, then D is also 
a vector in space-time. It is termed the 4-force density vector for the 
field. 
When measured from an inertial frame S, let dw be the volume of 
_ the element and let f be the 3-force exerted by the field upon it. We 
shall define the 3-force density vector d in S by the equation 


f = ddw. (26.2) 


dw is an invariant with respect to transformations between Cartesian 
frames stationary in S and hence d is a vector relative to such 
frames. 
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Substituting for F, f from equations (26.1), (26.2) respectively into 
equation (18.2), we obtain 


D day = (a,£a-v) dal —v*/c?)~"/?, (26.3) 


By virtue of the relationship (21.8), this reduces immediately to the 
equation 


D= (a,‘a-»), (26.4) 
c 


é 


relating the 3- and 4-force density vectors. 


27. The energy-momentum tensor for an electromagnetic field 
Suppose that a charge distribution is specified by a 4-current density 
vector J. If dwg is the proper volume of any small element of the distri- 
bution and pp is the proper density of the charge, the charge within 
the element will be podwo. It follows from equation (25.4) that the 
4-force exerted upon the element by the electromagnetic field is given 
by 

F; = Fi, Video, (27.1) 
V being the 4-velocity of flow for the element. Employing equation 
(21.7), this last equation can be written 


l 
F; = = Fudsduo (27.2) 


and it follows from the definition given in the last section that the 
4-force density for the electromagnetic field is given by 
1 
D; ad a Fas (27.3) 


Substituting for J; from the first of equations (23.11), we can express 
D, in terms of the field tensor thus: 


1 
D; = ane Fix, k- (27.4) 
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We will now prove that the right-hand member of this equation is, 
apart from sign, the divergence of a certain symmetric tensor Sy given 
by the equation 
1 1 
Sy = a, Fim Fin in 94j Fay Fra (27.5) 
and called the energy-momentum tensor of the electromagnetic field. 
Taking the divergence of S,, we have 


1 1 1 
Sy = —Fix,j Fin t+ Fiz Fjxj- ra 84 Fat Fuj. (27.6) 


4 Arr 
Now 
Fix g Fy = Fy Fg = Fyn Fins (27.7) 
since Fj; is skew-symmetric. Thus 
Fix, Fy = 8 iz t+ Fi) Fr. (27.8) 
Also 
8y Fa Fiay = Fa Fiat = — Fin Fig,i (27.9) 


and it follows from these results that the first and last terms of the 
right-hand member of equation (27.6) can be combined to yield 


1 
i (Fix, + Bint Fy, Fix (27.10) 


and this is zero by the second of equations (23.11). 
Hence 


. 1 1 
Sy,j = dart tk Fak J = — Gy ie Pg. = — D;. (27.11) 

Substituting for the components of the field tensor from equation 
(23.5), the components of Sj are calculable from equation (27.5) as 
follows: If i, j take any of the values 1, 2, 3, then writing E, for E,, 
E, for E,, etc. 


1 
Sy= —pGEt+HH), i# (27.12) 
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Ifi=j=1, 


1 1 
5), = ~ (H24+ H2-E2)—-—(H?- E>, 
11 a,° 2+ H3—£}) rm ) 


1 1 
= 4, Fit A+ (E? +H’). (27.13) 


Soo, S33 may be expressed similarly and therefore, in general, if 
i,j = 1, 2, 3, 


1 1 
Sy = -7 G+ Hi H)+ §(E?+H?). (27.14) 


Apart from sign, this is Maxwell’s stress tensor ty. ty is only a tensor 
with respect to rectangular frames stationary in the inertial frame 
being employed. 

Also, if i= 1, 2, 3, 


i 
Sig = Sqi = gy Co? Hs — Bs Ho, Bs Hi — By 3, E, H2— E, Aj) 


-—ExH=/“s, (27.15) 
rr c 


where S is Poynting’s vector. 
Finally, 
1 
Sua =e (E?+ H?) = —U, (27.16) 
TT 
where U is the energy density in the electromagnetic field. 


These results may be summarized conveniently by exhibiting the 
components of S; in a matrix thus: 


ti S/ic 
(Sj) = - (“-) : (27.17) 
S/ic} U 


Returning to the set of four equations (27.11), each can now be 
expressed in-classical three-dimensional form. Thus, if i= 1, 2, 3, by 
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reference to the equations (26.4) and (27.17) the corresponding equa- 
tion is seen to be equivalent to 


tyj- 3G, = &- (27.18) 
If i= 4, the corresponding equation is equivalent to 
. aU 
divS+— = —d-y, (27.19) 


Equations (27.18), (27.19) can be given simple physical interpreta- 
tions in the case when the motion of a cloud of charged particles which 
do not interact mechanically is being studied. Each particle is then 
subjected to electromagnetic forces only and it will be assumed that 
its rest mass is constant throughout the motion, i.e. no heat is gener- 
ated. Let ' be a closed surface stationary in the frame S and let I be 
the region of space it encloses. Integrating equation (27.19) over I" 
and employing Green’s theorem, it will be found that 


d 
| S,do = { d-vdeo+ = { Udw, (27.20) 
Pa r r 


where do is a surface element of 2, dw is a volume element of I’ and 
S,, is the component of S along the inwards normal to XY’. Now d-vdw 
is the rate at which the force applied to the charge in dw is doing work 
and is therefore equal, by equation (17.7), to the rate of increase of the 
mechanical energy of this charge. The first term of the right-hand 
member of equation (27.20) accordingly gives the total rate at which 
the mechanical energy of the charge which is inside J at the instant 
under consideration is increasing. Since 2 is a fixed surface and the 
charge is moving, some of this mechanical energy will be lost by 
transport of charge across 2. We have, therefore, 


f d-vdw = rate of increase of mechanical energy inside 2 
r 


+rate of loss of mechanical energy across 2. (27.21) 


The second term of the right-hand member of equation (27.20) 
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measures the rate of increase of electromagnetic field energy inside 2. 
Equation (27.20) accordingly asserts that 


| S,,do+ rate of gain of mechanical energy across 2 
>> 


= total rate of energy increase within 2. (27.22) 


For the law of conservation of energy to be valid, it is clearly necessary 
to interpret the inward flux of S across 2 as the rate of flow of electro- 
magnetic energy across this surface. Thus S is the energy current 
density vector. 

Taking 2 to be a surface whose elements are all at a great distance 
from the charge flow being considered, so that E = 0, H = 0 over 2 
and integrating the i‘" equation (27.18) over I’, the contribution of the 
term ty, ; will be zero. This follows since ty, ; is the ordinary divergence 
of a vector having components (t;1, t;2, t;3) and its volume integral, by 
Green’s theorem, can be expressed as a surface integral of this vector 
over / and the vector is everywhere zero on &. There results, therefore, 


dfl 

i did +— | 2 Sides = 0. (27.23) 
r r 

Now d,dw is the i" component of the force exerted upon the charge in 
dw and therefore gives the rate at which its momentum is increasing. 
It follows from equation (27.23) that the net i th component of momen- 
tum of an isolated system of charges is conserved only if the electro- 
magnetic field is supposed to contribute momentum whose density is 
S/c*. This electromagnetic momentum density vector will be denoted 
by g and thus 


1 
= -.S, 27. 
Z 38 (27.24) 
If w is the velocity of propagation of electromagnetic energy, then 
S = Uw (27.25) 
and hence, by equation (27.24), 
U 
Zg= a w. (27.26) 


But, according to equation (17.14), U/c* is the mass density associated 
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with an energy density U. It is consistent with earlier theory therefore, 
that such a mass density flowing with velocity w should generate a 
momentum density g. 


28. Equations of motion of a charge flow 
In this section, we shall continue to restrict our attention to a system 
comprising a cloud of charged particles which do not interact mechan- 
ically and whose proper masses remain constant during their motions. 
Since the proper masses of the particles are conserved during the 
motion, an equation.of continuity for proper mass can be found. Let 
be any closed surface bounding a region I’. Then the rate at which 
the net proper mass in I" is decreasing must equal the rate at which 
proper mass is being lost by outwards flow across 2. Let dw be the 
volume of an element of the charge distribution as measured in the 
inertial frame S being employed and let podw be the proper mass of 
the element. xo is the density of proper mass in S. odw is an invariant, 
but dw is not and hence neither is jzp. Then the mathematical expres- 
sion of the statement we have just made is 


“ai podw = J reve (28.1) 


where v,, is the component of the velocity of flow v along the outward 
normal to 2. Employing Green’s theorem, equation (28.1) can be 
written 


a 
| ("ee +div wow) dw = 0, (28.2) 
: 
and, since I is arbitrary, this implies the equation of continuity 
7) 
a +div(u9¥) = (28.3) 


Let dwg be the proper volume of a charge element and let 99 dw be 
the proper mass of this element. Then jo9 is called the proper density 
of proper mass. Since ppg dwo and dup are invariants, So is Zoo. Clearly 


Hop dwy = podw (28.4) 
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and hence, by equation (21.8), 


Poo = (L—v?/c?)"? pup. (28.5) 


It now follows that equation (28.3) can be written in the covariant 
form 


7) 
Fa, too Vi) = 0, (28.6) 


where V; is the 4-velocity of flow. 

The 4-force exerted by the field upon the charge element of proper 
volume dwg is, by equation (26.1), D;dwo. Since prgq dw is the proper 
mass of this element, its equation of motion (see equation (17.2)) is 
therefore 

D; = boo (28.7) 
TT 
But V; can be expressed as a function of the x; and, as the charge 
element moves, its coordinates x; vary as functions of its proper time 
7. It follows that 


a= it Ip, (28.8) 


Hence 


OV, _. a 
— (1199 VV) = poo Vj — + Vi— Goo V), 
(H00 Vi Vj) = Hoo lex, tj, Woo -) 


= Di, (28.9) 


equations (28.6), (28.8) and (28.7) having been employed to effect the 
reduction. 
We now define a symmetric tensor 9, by the equation 


Oy = po ViV;, (28.10) 
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so that D; can be written as the divergence of this tensor thus 


D; = @;; ;. (28.11) 
Consider the components of 0,;. We have 
2 2 
Soin rN See Oe et te 
Ou = 1 —wv?/c? (1 —v?/c?)!/2 is; A281) 


where yp is the mass density in S. Hence, apart from sign, @4, is the 
mechanical energy density W. 


If i= 1; 2, 3, then 


iCjL99 Vi : : 
04 = ee 72 = icuv; = icg,, (28.13) 


where g = pv is the momentum density. 
Finally, if i, j = 1, 2, 3, 


U;V; 
0, oy noe 5 = PU; V; = 8; Vj. (28.14) 


Now g;,V is the current density of the i** component of the momentum 
and thus the i" row of the 3 x 3 matrix (9,,) can be so interpreted. 
To summarize, we have shown that 


8iV; icg 


(0) = (28.15) 
icg|—W 
This representation should be compared with the representation 
(27.17) of the electromagnetic energy-momentum tensor S;;. Since, by 
equation (27.24), S = c’g, it is clear that @,, is the counterpart for the 
mass distribution of S,; for the electromagnetic field. 0; is called the 
kinetic energy-momentum tensor. 
Substituting in the equation of motion (28.11) for D; from equation 
(27.11), we find that 
— Si ji = Oyj. (28.16) 
Writing 
we have finally 
Ty; = 0. (28.18) 
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7; is the overall energy-momentum tensor including contributions 
from an energy distribution in an electromagnetic form and from an 
energy distribution in a material form. Equation (28.18) indicates 
that the flow is determined by the statement that the divergence of the 
net energy-momentum tensor vanishes. This result can be proved to 
be true quite generally, i.e. corresponding to any energy (matter) 
distribution, there is an energy-momentum tensor 7;; whose diver- 
gence vanishes. It will be shown later in section 48 that this tensor also 
determines the gravitational field of the distribution. 

As shown in the previous section, equation (28.18) with i= 1, 2, 3, 
expresses that the net linear momentum of the system is conserved 
and, with i = 4, that the total energy is conserved. 


Exercises 4 
1. Write down the special Lorentz transformation equations for J 
and deduce the transformation equations for j, p, viz. 
Je = (1 —u?/c*)—7(j,,— pu), hy = jy, 
Pp = (1 —u?}c?)~"!*(p—j, uc), iP = jz. 
2. Deduce from the Maxwell equation 


4 
Fiji = Pa 


that divJ = 0. 

3. Verify that the field tensor defined in terms of the 4-potential 
92; by equation (23.13) satisfies Maxwell’s equations (23.11) provided 
§2; satisfies the equations (23.12). 

4. (i) Prove that 

Fj Fy = 2(H?—E”) 


and deduce that H? — E? is invariant with respect to Lorentz trans- 
formations. 
(ii) Prove that 
Ciki F, Fi = 8iE-H 


and deduce that E-H is an invariant density with respect to Lorentz 
transformations. 
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5. Aninfinite uniform line charge lies along the x-axis of the inertial 
frame S and has longitudinal velocity u. As measured in S, the charge 
per unit length is e. A point P is at distance r from the x-axis and a is 
the unit vector along the perpendicular to this axis through P. Show 
that the electric and magnetic field intensities at P are given by 


6. An observer O at rest in an inertial frame Oxyzt finds himself to 
be in an electric field E = (0, E,0), with no magnetic field. Show that 
an observer O’ moving according to O with uniform velocity V at 
right angles to E, finds electric and magnetic fields E’, H’ connected 
by the relation 

cH’+VXE’ = 0. 
(L.U.) 


7, Oxyz are rectangular axes. An electron moves in the xy-plane 
under the action of a uniform magnetic field parallel to Oz. Prove that 
its path is a circle. 

8. A plane monochromatic electromagnetic wave is being propa- 
gated in a direction parallel to the x-axis in the inertial frame S. Its 
electric and magnetic field components are given by 


E = (0, asin w(t—<x/c), 0], 
H = (0,0, asin w(t—x/c)]. 


Show that, when observed from the inertial frame 5, it appears as the 
plane monochromatic wave 
E = (0, Aasin Aw(f —Z/c), 0], 


Ai = (0,0, Aasin Aw(F — X/c)], 


oy 1—u/c 
where A= (=): 


u being the velocity of § relative to S. (I.e. both the amplitude and 
frequency are reduced by a factor A. The reduction in frequency is the 
Doppler effect.) 
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9. Show that the Hamiltonian for the motion of a particle with 
charge e and mass m in an electromagnetic field (A, ¢) is 


oN 1/2 
H= e|(p-£4) tne +e¢ 


_ and express this equation in the covariant form 
P-& = —m?2c?, 


10. Verify that, in a region devoid of charge, equations (23.12) are 
satisfied by 
Q i = A; elke 7*. 


provided A,;, k, are constants such that 
A,k; = 0, kk, = 0. 


By considering the 4-vector property of 2;, deduce that A; must 
transform as a 4-vector under Lorentz transformations. Deduce also 
that k,x, is a scalar under such transformations and hence that k, is 
a 4-vector. 

A plane electromagnetic wave, whose direction of propagation is 
parallel to the plane Oxy and makes an angle « with Ox, is given by 


Q; — A; e2tiv(x cosa+ysin acne 


where v is the frequency. The same wave observed from a parallel 
frame OxjZ moving with velocity u along Ox, has frequency * and 
direction of propagation making an angle « with Ox. By writing 


down the transformation equations for the vector k,, prove that 


u Uu 
1—- cosa cos a—— 
iS c _ c 
j=», COSa = 
Uu 
(1 —v2/c2)1/2 1—-cosa 
c 


11. A particle of mass m and charge e moves freely in a magnetic 
field with components (0,0, H/z) and there is no electric field. Show 
that mm is constant during the motion and, by a suitable choice of the 
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initial conditions, prove that the motion of the particle is given by 
x = atsin(Alog)), 
y = atcos(Alogd), 
z= kt, 


where A= eH /mck and hence that the particle moves on the surface of 
the cone 
k2(x2+ y?) = a2z?. 


CHAPTER 5 
General Tensor Calculus. Riemannian Space 


29. Generalized N-dimensional spaces 

In Chapter 2 the theory of tensors was developed in an N-dimensional 
Euclidean space on the understanding that the coordinate frame being 
employed was always rectangular Cartesian. If x;, x;+ dx; are the co- 
ordinates of two neighbouring points relative to such a frame, the 
‘distance’ ds between them is given by the equation 


ds* = dx; dax;. (29.1) 


If X;, X; + dX; ace the coordinates of the same points with respect to 
another rectangular Cartesian frame, then 


ds* = dk;di; (29.2) 


and it follows that the expression dx;dx; is invariant with respect to a 
transformation of coordinates from one rectangular Cartesian frame 
to another. Such a transformation was termed orthogonal. 

Now, even in &3, it is very often convenient to employ a coordinate 
frame which is not Cartesian. For example, spherical polar coordin- 
ates (r,0,¢) are frequently introduced, these being related to rect- 
angular Cartesian coordinates (x,y,z) by the equations 


x=rsin@cos¢, y=rsin@sin¢g, z=rcos@. (29.3) 
In such coordinates, the expression for ds” will be found to be 
ds? = dx? +dy’+dz’, 
= dr? +r d6? +r sin? 0.d¢?, (29.4) 


and this is no longer of the simple form of equation (29.1). The co- 

ordinate transformation (29.3) is accordingly not orthogonal. In fact, 

it is not even linear, as was the most general coordinate transforma- 
tion (8.1) considered in Chapter 2. 

The spherical polar coordinate system is an example of a curvilinear 
81 


82 TENSOR CALCULUS AND RELATIVITY 


coordinate frame in &3. Let (u,v, w) be quantities related to rectangular 
Cartesian coordinates (x,y,z) by equations 


u = u(x,y,z), v= v(x,y,z), w= w(x,y,Z), (29.5) 


such that, to each point there corresponds a unique triad of values of 
(u,v, w) and to each such triad there corresponds a unique point. Then 
a set of values of (u,v, w) will serve to identify a point in &, and (u,v, w) 
can be employed as coordinates. Such generalized coordinates are 
called curvilinear coordinates. 
The equation 
u(x, y,Z) = Uo, (29.6) 


where up is some constant, defines a surface in &; over which u takes 
the constant value up. Similarly, the equations 


v= Ug, W= Wo (29.7) 


define a pair of surfaces on which v takes the value vp and w the value 
Wo respectively. These three surfaces will all pass through the point Po 
having coordinates (up, vp, Wo) aS Shown in Fig. 4. They are called the 
coordinate surfaces through Po. The surfaces v = vo, w = Wo will inter- 
sect in a curve Pp U along which v and w will be constant in value and 
only u will vary. Po U is a coordinate line through Po. Altogether, three 
coordinate lines pass through Pp. The equations uv = constant, v = con- 
stant, w = constant define three families of coordinate surfaces corre- 
sponding to the three families of planes parallel to the coordinate 
planes x = 0, y= 0, z= 0 of a rectangular Cartesian frame. Pairs of 
these surfaces intersect in coordinate lines which correspond to the 
parallels to the coordinate axes in a Cartesian frame. 

Solving equations (29.5) for (x,y,z) in terms of (u,v, w), we obtain 
the inverse transformation 

x = x(u,v,w), y= ylu,v,w), z= zu,v,w). (29.8) 

Let (x,y,z), (x+dx,y+dy,z+dz) be the rectangular Cartesian co- 
ordinates of two neighbouring points and let (u,v, w), (u+du,v+ dv, 
w-+dw) be their respective curvilinear coordinates. Differentiating 
equations (29.8), we obtain 
ox 


ox 
re dut+ ow dw, etc. (29.9) 


ex 
= — di 
Ou ut 
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Thus, if ds is the distance between these points, 


ds* = dx? + dy* +dz?, 
= Adu*+ Bd? + Cdw* +2F du dw+2G dwdu+2Hdudv, (29.10) 


giving the appropriate expression for ds? in curvilinear coordinates. It 
will be noted that the coefficients A, B, etc. are, in general, functions 
of (u,v, w). ; 


Fic. 4 


If, therefore, curvilinear coordinate frames are to be permitted, the 
theory of tensors developed in Chapter 2 must be modified to make it 
independent of the special orthogonal transformations for which ds* 
is always expressible in the simple form of equation (29.1). The 
necessary modifications will be described in the later sections of this 
chapter. However, these modifications prove to be of such a nature 
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that the amended theory makes no appeal to the special metrical 
properties of Euclidean space, i.e. the theory proves to be applicable 
in more general spaces for which Euclidean space is a particular case. 
This we shall now explain further. 

Let (x!,x*,...,x%) be curvilinear coordinates in &y.t Then, by 
analogy with equation (29.10), if ds is the distance between two 
neighbouring points, it can be shown that 


ds* = gy dx! dx’, (29.11) 


where the coefficients g;; of the quadratic form in the x! will, in general, 
be functions of these coordinates. Since the space is Euclidean, it is 
possible to transform from the curvilinear coordinates x! to Cartesian 
coordinates y' so that 


ds* = dy' dy’. (29.12) 


Clearly, the reduction of ds* to this simple form is only possible 
because the functions g,, satisfy certain conditions. Conversely, the 
satisfaction of these conditions by the g;; will guarantee that coordin- 
ates y! exist for which ds” takes the simple form (29.12) and hence that 
the space is Euclidean. However, in extending the theory of tensors to 
be applicable to curvilinear coordinate frames, we shall, at a certain 
stage, make use of the fact that ds* is expressible in the form (29.11), 
but no use will be made of the conditions satisfied by the coefficients 
gi; which are a consequence of the space being Euclidean. It follows 
that the extended theory will be applicable in a hypothetical N-dimen- 
sional space for which the ‘distance’ ds between neighbouring points 
x!, x! + dx! is given by an equation (29.11) in which the g;; are arbitrary 
functions of the x'.* Such a space is said to be Riemannian and will be 
denoted by #y. @y isa particular #y for which the g,; satisfy certain 
conditions. The right-hand member of equation (29.11) is termed the 
metric of the Riemannian space. 

The surface of the Earth provides an example of an #,. If 6 is the 


+ The coordinates are here distinguished by superscripts instead of sub- 
scripts for a reason which will be given later. 

* Except that partial derivatives of the g;; will be assumed to exist and to 
be continuous to any order required by the theory. 
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co-latitude and ¢ is the longitude of any point on the Earth’s surface, 
the distance ds between the points (6,4), (0+ d0,¢+d¢) is given by 


ds? = R*(d6" + sin’ 6 dd), (29.13) 


where Ris the earth’s radius. For this space and coordinate frame, the 
gi take the form 


Gi = R, 812 = 821 = 0, B22 = R’sin?@. — (29.14) 
It is not possible to define other coordinates (x,y) in terms of which 


ds* = dx*+dy* (29.15) 


over the whole surface, i.e. this #, is not Euclidean. However, the 
surfaces of a right circular cylinder and cone are Euclidean; the proof 
is left as an exercise for the reader. 

It will be proved in Chapter 6 that, in the presence of a gravitational 
field, space-time ceases to be Euclidean in Minkowski’s sense and 
becomes an &,. This is our chief reason for considering such spaces. 
However, we can generalize the concept of the space in which our 
tensors are to be defined yet further. Until section 39 is reached, we 
shall make no further reference to the metric. This implies that the 
theory of tensors, as developed thus far, is applicable in a very general 
N-dimensional space in which it is assumed it is possible to set up a 
coordinate frame but which is not assumed to possess a metric. In such 
a hypothetical space, the distance between two points is not even 
defined. It will be referred toas Sy. Ay isa particular Sy for which 
a metric is specified. 


30. Contravariant and covariant tensors 

Let x! be the coordinates of a point P in Wy relative to a coordinate 
frame which is specified in some manner which does not concern us 
here. Let x’ be the coordinates of the same point with respect to 
another reference frame and let these two systems of coordinates be 
related by equations 


x! = H(x1,x7,...,x). (30.1) 


Consider the neighbouring point P’ having coordinates x! + dx! in the 
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first frame. Its coordinates in the second frame will be x‘ + dz', where 


dx! = — dx’, (30.2) 


and summation with respect to the index j is understood. The N 
quantities dx’ are taken to be the components of the displacement 
vector PP’ referred to the first frame. The components of this vector 
referred to the second frame are, correspondingly, the dx! and these 
are related to the components in the first frame by the transformation 
equation (30.2). Such a displacement vector is taken to be the proto- 
type for all contravariant vectors. 

Thus, 4’ are said to be the components of a contravariant vector 
located at the point x’, if the components of the vector in the ‘ barred’ 
frame are given by the equation 


ax! 


j 
=I’. (30.3) 


Ai= 
It is important to observe that, whereas in Chapter 2 the coefficients 
a,j occurring in the transformation equation (10.2) were not functions 
of the Cartesian coordinates x; so that the vector A was not, neces- 
sarily, located at a definite point in &y, the coefficients 2x'/2x/ in the 
corresponding equation (30.3) are functions of the x! and the precise 
location of the vector A’ must be known before its transformation 
equations are determinate. This can be expressed by saying that there 
are no free vectors in Sy. 

The form of the transformation equation (30.3) should be studied 
carefully. It will be observed that the dummy index j occurs once as a 
superscript and once as a subscript (i.e. in the denominator of the 
partial derivative). Dummy indices will invariably occupy such 
positions in all expressions with which we shall be concerned in the 
sequel. Again, the free index i occurs as a superscript on both sides of 
the equation. This rule will be followed in all later developments, i.e. 
a free index will always occur in the same position (upper or lower) in 
each term of an equation. Finally, it will assist the reader to memorize 
this transformation if he notes that the free index is associated with 
the ‘barred’ symbol on both sides of the equation. 
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A contravariant vector A’ may be defined at one point of /y only. 
However, if it is defined at every point of a certain region, so that the 
A’ are functions of the x’, a contravariant vector field is said to exist in 
the region. 

If V is a quantity which is unaltered in value when the reference 
frame is changed, it is said to be a scalar or an invariant in Sy. Its 
transformation equation is simply 


V= Vz (30.4) 


Since this equation involves no coefficients dependent upon the x’, 
the possibility that Vmay bea free invariant exists. However, Vis more 
often associated with a specific point in Aj and may be defined at all 
points of a region of y, in which case it defines an invariant field. In 
the latter case 

V = V(x!,x7,...,x%). (30.5) 


P will then, in general, bea quite distinct function of the x‘. If, however, 

in this function we substitute for the <’ in terms of the x! from equation 
(30.1), by equation (30.4) the right-hand member of equation (30.5) 
must result. Thus 


Vx! x’,...,2%) = Vix!) x*,...,x7). (30.6) 


V being an invariant field, consider the N derivatives 2V/ax!. In the 
x'-frame, the corresponding quantities are 2V//ax' and we have 
tare Z 
a ee (30.7) 
ox’ —s Ax? ax’ ox! Ox! 
since, by equation (30.6), when V is expressed as a function of the x 
it reduces to V. As in Chapter 2, the 0V/ax! are taken to be the com- 
ponents of a vector called the gradient of V and denoted by grad V. 
However, its transformation law (30.7) is not the same as that for a 
contravariant vector, viz. (30.3) and it is taken to be the prototype for 
another species of vectors called covariant vectors. 
Thus, B; is a covariant vector if 


ax4 
ax! 


B, = —.B,. (30.8) 
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Covariant vectors will be distinguished from contravariant vectors by 
writing their components with subscripts instead of superscripts. This 
notation is appropriate, for 2V/dx' is a covariant vector and the index i 
occurs in the denominator of this partial derivative. The vector dx’, 
on the other hand, has been shown to be contravariant in its transfor- 
mation properties and this is correctly indicated by the upper position 
of the index. This is the reason for denoting the coordinates by x! 
instead of x;, although it must be clearly understood that the x! alone 
are not the components of a vector at all. 

The reader should check that the three rules formulated above in 
relation to the transformation equation (30.3), apply equally to the 
equation (30.8). 

The generalization from vectors to tensors now proceeds along the 
same lines as in section 10. Thus, if A‘, B’ are two contravariant 
vectors, the N? quantities A'B/ are taken as the components of a 
contravariant tensor of the second rank. Its transformation equation 
is found to be 


=i osj 
A Bi = at aut B’. (30.9) 


Any set of N? quantities C! transforming in this way is a contravariant 
tensor. 

Again, if A’, B; are vectors, the first contravariant and the second 
covariant, then the N” quantities A! B, transform thus: 


I 
ak agit Br (30.10) 


Any set of N? quantities C! transforming in this fashion is a mixed 
tensor, i.e. it possesses both contravariant and covariant properties 
as is indicated by the two positions of its indices. 

Similarly, the transformation law for a covariant tensor of rank 2 
can be assembled from the law for covariant vectors. 

The further generalization to tensors of higher rank should now be 
an obvious step. It will be sufficient to give one example. Ai, is a 
mixed tensor of rank 3, having both the covariant and contravariant 
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properties indicated by the positions of its indices, if it transforms 
according to the equation 


ax! ax ax! 


PF | Panicle cede 
Jk ax? axl ax* 


(30.11) 


The components of a tensor can be given arbitrary values in any 
one frame and their values in any other frame are then uniquely de- 
termined by the transformation law. Consider the mixed second rank 
tensor whose components in the x'-frame are 5) , the Kronecker deltas 
(5) = = 0,i+jand 5} = = 1,i=/). The somporients in the <'-frame are 5, 
where 


=i I 
gi. OF OX oy 
4 axk agi !? 


ax! axk 
ax ax” 


ax! 


= §. (30.12) 


Thus this tensor has the same components in all frames and is called 
the fundamental mixed tensor. However, a second rank covariant ten- 
sor whose components in the x'-frame are the Kronecker deltas (in 
this case denoted by 6,,), has different components in other frames and 
is accordingly of no special interest. 

It is reasonable to enquire at this stage why the distinction between 
covariant and contravariant tensors did not arise when the coordinate 
transformations were restricted to be orthogonal. Thus, suppose that 
A’, B; are contravariant and covariant vectors with respect to the 
orthogonal transformation (8.1). The inverse transformation has been 
shown to be equation (11.5) and it follows from these two equations 
that 


i 
— = —_— = oe 5 30. 
aij, ax; Qi ( 1 3) 
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For the particular case of orthogonal transformations, therefore, 
equations (30.3), (30.8) take the form 


A! = ay A, B; = ay B;. (30.14) 


It is clear that both types of vector transform in an identical manner 
and the distinction between them cannot, therefore, be maintained. 

As in the case of the Cartesian tensors of Chapter 2, new tensors 
may be formed from known tensors by addition (or subtraction) and 
multiplication. Only tensors of the same rank and type may be added 
to yield new tensors. Thus, if Aj,, Bj, are components of tensors and 
we define the quantities C/;, by the equation 


Ci, = Ait Bis (30.15) 


then Che are the components of a tensor having the covariant and 
contravariant properties indicated by the position of its indices. How- 
ever, Ai, B, cannot be added in this way to yield a tensor. Any two 
tensors may be multiplied to yield a new tensor. Thus, if A! i> Bi, are 
tensors and we define N° quantities Cr by the equation 


Cir, = Aj Bs (30.16) 


these are the components of a fifth rank tensor having the covariant 
and contravariant properties indicated by its indices. The proofs of 
these statements are left for the reader to provide. 

If a tensor is symmetric (or skew-symmetric) with respect to two 
of its superscripts or to two of its subscripts in any one frame, then it 
possesses this property in every frame. The method of proofis identical 
with that of the corresponding statement for Cartesian tensors given 
in section 10. However, if Aj = A/is true for all i, j when one reference 
frame is being employed, this equation will not, in general, be valid 
in any other frame. Thus, symmetry (or skew-symmetry) of a tensor 
with respect to a superscript and a subscript is not, in general, a 
covariant property. The tensor 5} is exceptional in this respect. 

Another result of great importance which may be established by the 
same argument we employed in the particular case of Cartesian ten- 
sors, is that an equation between tensors of the same type and rank is 
valid in all frames if it is valid in one. This implies that such tensor 
equations are covariant (i.e. are of invariable form) with respect to 
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transformations between reference frames. The usefulness of tensors 
for our later work will be found to depend chiefly upon this property. 

A symbol such as A}, can be contracted by setting a superscript and 
a subscript to be the same letter. Thus Ai, Ai, are the possible con- 
tractions of Ai, and each, by the repeated index summation conven- 
tion, represents a sum. Since in the symbol Ai,, j alone is a free index, 
this entity has only N components. Similarly Aj, has N components. 
It will now be proved that, if Ai i is a tensor, its contactors are also 
tensors. Specifically, we shall prove that B; = Aj; is a covariant vector. 
For 


ax! ax’ axt 
ax’ ax! ax! sts 


_ (22! 32) ax" 4 
~ Vas! ax’) axis” 


B; = Ai; = 


= og B;, (30.17) 
establishing the result. This argument can obviously be generalized 
to yield the result that any contracted tensor is itself a tensor of rank 
two less than the tensor from which it has been derived and of the type 
indicated by the positions of its remaining free indices. In this con- 
nection it should be noted that, if Aj, is a tensor, Aj, is not, in general, 
a tensor; it is essential that the contraction be with respect to a super- 
script and a subscript and not with respect to two indices of the same 
kind. 

If Aj,, BY are tensors, the tensor 4}, B’ is called the outer product 
of these two tensors. If this product is now contracted with respect to 
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a superscript of one factor and a subscript of the other, e.g. Ai, Bj, 
the result is a tensor called an inner product. 


31. The quotient theorem. Conjugate tensors 
It has been remarked in the previous section that both the outer and 
inner products of two tensors are themselves tensors. Suppose, how- 
ever, that it is known that a product of two factors is a tensor and that 
one of the factors is a tensor, can it be concluded that the other factor 
is also a tensor? We shall prove the following quotient theorem: If 
the result of taking the product (outer or inner) of a given set of elements 
with a tensor of any specified type and arbitrary components is known 
to be a tensor, then the given elements are the components of a tensor. 
It will be sufficient to prove the theorem true for a particular case, 
since the argument will easily be seen to be of general application. 
Thus, suppose the A Nk are N°? quantities and it is to be established that 
these are the components of a tensor of the type indicated by the 
positions of the indices. Let B, be a mixed tensor of rank 2 whose 
components can be chosen arbitrarily (in any one frame only of course) 
and suppose it is given that the inner product 


Ai, BE = Cjs (31.1) 


is a tensor for all such B{. All components have been assumed calcu- 
lated with respect to the x-frame. Transforming to the x-frame, the 
inner product is given to transform as a tensor and hence we have 


Aik BE = Cis, (31.2) 


where Ai* are the actual components replacing the Aj, when the 
reference frame is changed. Let Aj, be a set of elements defined in the 
x-frame by equation (30.11). Since this is a tensor transformation 
equation, we know that the elements so defined will satisfy 


Ai, BE = Ci,. (31.3) 
Subtracting equation (31.3) from (31.2), we obtain 
(Ait — 4i,) BY = 0. (31.4) 


Since B¢ has arbitrary components in the x-frame, its components in 
the x-frame are also arbitrary and the components BE can assume any 
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convenient values. Thus, taking B* =1 when k= K and B¥ =0 
otherwise, equation (31.4) yields 


or AiR = Aig. (31.5) 
This being true for K = 1, 2, ..., N, we have quite generally 


ik = Aly. (31.6) 
This implies that Aj, does transform as a tensor. 

We will first give a very simple example of the application of this 
theorem. Let A! be an arbitrary contravariant vector. Then 


5} A/ = A! (31.7) 


and since the right-hand member of this equation is certainly a vector, 
by the quotient theorem 5 is a tensor (as we have proved earlier). 

As a second example, let g;; be a symmetric covariant tensor and 
let g = |g;;| be the determinant whose elements are the tensor’s com- 
ponents. We shall denote by G” the co-factor in this determinant of 
the element g,. Then, although G’ is not a tensor, if g 4 0, G4/g = g!/ 
is a symmetric contravariant tensor. To prove this, we first observe 
that 


gyG = gf, 2,G* = gd, (31.8) 
and hence, dividing by g, 
ge = OF, gygi* = 8. (31.9) 


- Now let 4! be an arbitrary contravariant vector and define the co- 
variant vector B; by the equation 


B; = g;,A*. (31.10) 


Since g 4 0, when the components of B; are chosen arbitrarily, the 
corresponding components of A’ can always be calculated from this 
last equation, i.e. B; is arbitrary with A’. But 


g! B; = g 9; AX = 84, AK = A’, (31.11) 
having employed the second identity (31.9). It now follows by the 
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quotient theorem that g” is a contravariant tensor. That it is sym- 
metric follows from the circumstance that G! possesses this property. 
gy, 84 are said to be conjugate to one another. 


32. Relative tensors and tensor densities 

The coordinates <' being expressed in terms of the coordinates x! by 
the equations (30.1), we shall define the determinant D of the trans- 
formation to have 2x!/az/ for its ij" element. Thus 


ax! 
ax/ 
Then 2% is a relative tensor of weight W (W a positive or negative 


integer) having both covariant and contravariant characteristics as 
are indicated by its indices, if its components transform thus: 


D= (32.1) 


G = D* ——_M- (32.2) 
This statement can be generalized in the obvious way to relative 
tensors of any rank. 

If W=0, a relative tensor reduces to an ordinary tensor. In the 
particular case W = 1, the relative tensor is referred to as a tensor 
density (cf. section 13). 

The following statements follow immediately from the definition of 
a relative tensor and the reader is left to devise formal proofs: (i) Rela- 
tive tensors of the same type and weight can be added or subtracted to 
yield new tensors of the same type and weight; (ii) two relative tensors 
can be combined into an outer product and this will be a relative 
tensor whose weight is the sum of the weights of its factors; (iii) if a 
relative tensor is contracted with respect to a contravariant and a co- 
variant index, its weight is unaltered but its rank is reduced by two. 

e/---" denotes a contravariant tensor density of the N‘® rank which 
is skew-symmetric with respect to every pair of indices. It then follows, 
as in section 13, that its components are determined as follows: 


t 


e/---" = 0, if two indices are the same, 


+e!2---% if; 7, ...,2isaneven permutation of 1, 2,..., N, 
= —el?---™ if; j,...,nis an odd permutation of 1, 2,..., N. 
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In particular, in the x-frame, we take e!?---" = 1. Then, in the X-frame, 


=1 2 =N 
a 
ax! Axi = ax" ‘ 
= DE, (32.3) 


where E is the determinant whose ij" element is 2x'/@x’, i.e. 


ax! 


E = |— 
ax/ 


: (32.4) 


Employing the usual rule for the multiplication of determinants, from 
equations (32.1), (32.4) it follows that 


ax! ax") — | ax! ; 
DE = | 5,4) = |agi| = |S = 1 (32.5) 
Hence 
ee ai, (32.6) 


proving that the components in the x-frame are identical with those 
in the x-frame. The density e’---” accordingly possesses components 
+1, —1 and 0 in all frames. 

It is left as an exercise for the reader to prove in a similar way that 
the skew-symmetric covariant relative tensor e,_._, of the N 'h rank 
and weight — 1, whose component e;2.__ v is unity in the x-frame, has 
the same components + 1, — 1 and Oinall frames. 

If A,isa covariant tenso.,, the determinant | A;;| provides an example 
of a relative invariant. For its transformation law is 


Ox” —s & 


axs | 


> 


|A;,| = 


e ox" 
ax? 


= D’|A,,|. (32.7) 


\Arsl | a3 


Its weight is seen to be 2. 
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Similarly, if A” is a contravariant tensor, the determinant [A will 
be found to transform according to the law 


|44| = E?|A¥| = D~?|A4| (32.8) 


and is therefore a relative invariant of weight — 2. 
Lastly, if A} is a mixed tensor, the determinant | A}| is an invariant, 
for its transformation law is: 


|4j| = DE|Aj| = |Ajl. (32.9) 


33. Covariant derivatives. Parallel displacement. Affine connection 
In the earlier sections of this chapter, the algebra of tensors was estab- 
lished and it is now time to explain how the concepts of analysis can 
be introduced into the theory. Our space y has N dimensions, but 
is otherwise almost devoid of special characteristics. Nonetheless, it 
has so far been able to provide all the facilities required of a stage upon 
which the tensors are to play their roles. It will now be demonstrated, 
however, that additional features must be built into the structure of 
S n, before it can function as a suitable environment for the operations 
of tensor analysis. 

It has been proved that, if ¢ is an invariant field, 24/@x' is a covariant 
vector. But, if a covariant vector is differentiated, the result is not a 
tensor. For, let A; be such a vector, so that 


xk 
A; = — Ak. (33.1) 
ox' 
Differentiating both sides of this equation with respect to x/, we 
obtain 
0A; ax* Ox! Ay # xk P (33.2) 
oxi ax! ax! ax!” axl agi * 
The presence of the second term of the right-hand member of this 
equation reveals that 2A,/2x’ does not transform as a tensor. However, 
this fact can be arrived at in a more revealing manner as follows: 
Let P, P’ be the neighbouring points x’, x'+ dx! and let A;, A;+dA; 
be the vectors of a covariant vector field associated with these points 
respectively. The transformation laws for these two vectors will be 
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different, since the coefficients of a tensor transformation law vary 
from point to point in / y. It follows that the difference of these two 
vectors, namely dA;, is not a vector. However, 


aA; 


Ay = 


dx! (33.3) 


and, since dx/ is a vector, if A; ; were a tensor, dA; would be a vector. 
A; ; cannot be a tensor, therefore. The source of the difficulty is now 
apparent. To define A, ,, it is necessary to compare the values assumed 
by the vector field A, at two neighbouring, but distinct, points and 
such a comparison cannot lead to a tensor. If, however, this procedure 
could be replaced by another, involving the comparison of two vectors 
defined at the same point, the modified equation (33.3) would be 
expected to be a tensor equation featuring a new form of derivative 
which is a tensor. This leads us quite naturally to the concept of 
parallel displacement. 

Suppose that the vector A; is displaced from the point P, at which 
it is defined, to the neighbouring point P’, without change in magnitude 
or direction, so that it may be thought of as being the same vector now 
defined at the neighbouring point. The phrase in italics has no precise 
meaning in /y as yet, for we have not defined the magnitude or the 
direction of a vector in this space. However, in the particular case 
when /y is Euclidean and rectangular axes are being employed, this 
phrase is, of course, interpreted as requiring that the displaced vector 
shall possess the same components as the original vector. But even in 
én, if curvilinear coordinates are being used, the directions of the 
curvilinear axes at the poirt P’ will, in general, be different from their 
directions at P and, as a consequence, the components of the displaced 
vector will not be identical with its components before the displace- 
ment. In /y, therefore, components of the displaced vector will be 
denoted by A;+ 5A;. This vector can now be compared with the field 
vector A;+ dA; at the same point P’. Since the two vectors are defined 
at the same point, their difference is a vector at this point, i.e. dA;— 5A; 
is a vector. The modified equation (33.3) is accordingly expected to be 
of the form 


dA;—8A; — A;,;dx » (33.4) 
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where A,.; is the appropriate replacement for A, ;. Since dx’ is an 
arbitrary vector and the left-hand member of equation (33.4) is known 
to bea vector, A;.; will, by the quotient theorem, be a covariant tensor. 
It will be termed the covariant derivative of A;. Thus, the problem of 
defining a tensor derivative has been re-expressed as the problem of 
defining parallel displacement (infinitesimal) of a vector. 

We are at liberty to define the parallel displacement of A; from P to 
P’ in any way we shall find convenient. However, to avoid confusion, 
it is necessary that the definition we accept shall be in conformity with 
that adopted in & y, which is a special case of 7 y. Suppose, therefore, 
that our “x is Euclidean and that y are rectangular Cartesian co- 
ordinates in this space. Let B; be the components of the vector field 4; 
with respect to these rectangular axes. Then 

ax! 
A; = 2, B; = an = Aj. (33.5) 
If the parallel displacement of the vector A; to the point P’ is now 
carried out, its Cartesian components B; will not change, i.e. 6B = 0. 
Hence, from the first of equations (33.5), we obtain 


a) (2) 


a y/ 
ax! ax dx* B;. (33.6) 


Substituting for B; into this equation from the second of equations 
(33.5), we find that 


84, = Ti, Ajax, (33.7) 
ay! ax! 
where ik = ax! axk dy) (33.8) 


This shows that, in &y, the 5A; are bilinear forms in the A, and dx*. 
In Sy, we shall accordingly define the 5A; by the equation (33.7), 
determining the N? quantities I™, arbitrarily at every point of S y.t 

ra Subject to the requirement that the I", are continuous functions of the 


x! and possess continuous partial derivatives to the order necessary to 
validate all later arguments. 
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This set of quantities I, is called an affinity and specifies an affine 
connection between the points of ,,. A space which is affinely con- 
nected possesses sufficient structure to permit the operations of tensor 
analysis to be carried out within it. 

For we can now write 


oA: 
dAy~8A, = “ide! ~The Aga, 
aA; 
- (A-rhai) de (33.9) 


But, as we have already explained, the left-hand member of this 
equation is a vector for arbitrary dx/ and hence it follows that 


GA; 
Ai;; = ai Pb AR (33.10) 


is a covariant tensor, the covariant derivative of A;. 

It will be observed from equation (33.10) that, if the components of 
the affinity all vanish over some region of /y, the covariant and 
partial derivatives are identical over this region. However, this will 
only be the case in the particular reference frame being employed. In 
any other frame the components of the affinity will, in general, be 
non-zero and the distinction between the two derivatives will be main- 
tained. In tensor equations which are to be valid in every frame, 
therefore, only covariant derivatives may appear, even if it is possible 
to find a frame relative to which the affinity vanishes. 

We have stated earlier that, when defining an affine connection, the 
components of an affinity may be chosen arbitrarily. To be precise, a 
coordinate frame must first be selected in “jy and the choice of the 
components of the affinity is then arbitrary within this frame. How- 
ever, when these have been determined, the components of the affinity 
with respect to any other frame are, as for tensors, completely fixed by 
a transformation law. This transformation law for affinities, we now 
proceed to obtain. 


34. Transformation of an Affinity 
The manner in which each of the quantities occurring in equation 
(33.10) transforms is known, with the exception of the affinity If. The 
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transformation law for this affinity can accordingly be deduced by 
transformation of this equation. Relative to the x-frame, the equation 
is written 


24 
Ajj = =~ TRA. (34.1) 
Since 4;, A;,; are tensors, 
ax" 
A; = gyi tn (34.2) 
axs ax* 
A;.3 = pal agi ane (34.3) 
Substituting in equation (34.1), we obtain 
ax* ax! ax" ex" 0A, &x" 5 @ 
Se x A. . 
ont agi “ss! — “Og ogl ax* * Ont ox! agi Tagg ax* or) 


Employing equation (33.10) to substitute for A,.,and cancelling a pair 
of identical terms from the two members of equation (34.4), this 
equation reduces to 

Ox af Ox J r a x _ pe OX 


~aglagh Ar = agragi Ar! Vagh A ad 


Since A, is an arbitrary vector, we can equate coefficients of A, from 
the two members of this equation to obtain 

ax” = @x® Ax! + e* x" 
ask ~ axtaxl” *° axl ax! 


(34.6) 


Multiplying both sides of this equation by 2x//ax” and using the result 


ax" axk ax . or) 
yields finally 
_ ax! ax’ ax! rp xT (34,8) 
¥ ax" axtaxs ax" ax! ax?” ; 


which is the transformation law for an affinity. 
It should be noted that, were it not for the presence of the second 
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term in the right-hand member of equation (34.8), I %; would trans- 
form as a tensor of the third rank having the covariant and contra- 
variant characteristics suggested by the positions of its indices. Thus, 
the transformation law is linear in the components of an affinity but 
is not homogeneous like a tensor transformation law. This has the 
consequence that, if all the components of an affinity are zero relative 
to one frame, they are not necessarily zero relative to another frame. 
However, in general, there will be no frame in which the components 
of an affinity vanish over a region of / y, though it will be proved that, 
provided the affinity is symmetric, it is always possible to find a frame 
in which the components all vanish at some particular point (see 
section 38). 

Suppose Ij, I'i* are two affinities defined over a region of Sy. 
Writing down their transformation laws and subtracting one from 
the other, it is immediate that 


ax* axs axt 
P-L" = Fazer Cu TD), (34.9) 


i.e. the difference of two affinities is a tensor. However, the sum of two 
affinities is neither a tensor nor an affinity. It is left as an exercise for 
the reader to show, similarly, that the sum of an affinity rk and a 
tensor Agi is an affinity. 

If Thi is symmetric with respect to its subscripts in one frame, it is 
denis in every frame. For, from equation (34.8), 


k ax* ax ax! ry OF a7 x" 
it ax’ ax) ax! ax? axt ax?” 
_ axk ax? ax’ "4 ax* ox? 
ax” axtaxt” "" ax axl agi” 
= Pk, (34.10) 


where, at the first step, we have put I, = I,. 


35. Covariant derivatives of tensors 
In this section, we shall extend the process of covariant differentiation 
to tensors of all ranks and types. 
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Consider first an invariant field V. When V suffers parallel displace- 
ment from P to P’, its value will be taken to be unaltered, i.e. 5V = 0 
in all frames. Hence 


av 
dV—8V = — dx! (35.1) 
ox 


is the counterpart for an invariant of equation (33.4). It follows that 
Vii= Vi, (35.2) 


i.e. the covariant derivative of an invariant is identical with its partial 
derivative or gradient. 

Now let B! be a contravariant vector field and A; an arbitrary co- 
variant vector. Then A,B’ is an invariant and, when parallel displaced 
from P to P’, remains unchanged in value. Thus 


5(A4;B') = 0, 
or 8A, B'+.A,;5B' = 0, 
and hence, by equation (33.7), | 
A, SBE = —I' A;dx! B', (35.3) 
But, since the A, are arbitrary, their coefficients in the two members 
of this equation can be equated to yield 
SBE = —TEB' dx’. (35.4) 
This equation defines the parallel displacement of a contravariant 


vector. The covariant derivative of the vector is now deduced as 
before: Thus 


aB* 
dB*—8B* = (F+ rps) ae (35.5) 


and since dx/ is an arbitrary vector and dB* — 5B* is then known to be 
a vector, 

OBY ea 

7 on py t PGB (35.6) 


is a tensor called the covariant derivative of B*. 
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Similarly, if Aj is a tensor field, we consider the parallel displacement 
of the invariant Aj B,C’, where B;, C’ are arbitrary vectors. Then, from 


5(4} B,C’) = 0 (35.7) 
and equations (33.7), (35.4), we deduce that 
SAi = Ih, Aidx* — Tj, Afdx*. (35.8) 


It now follows that 
i 


ry | a 
ik = ae \ Ait Ij, Ay (35.9) 


is the covariant derivative required. 

The rule for finding the covariant derivative of any tensor will now 
be plain from examination of equation (35.9), viz., the appropriate 
partial derivative is first written down and this is then followed by 
‘affinity terms’; the ‘affinity terms’ are obtained by writing down an 
inner product of the affinity and the tensor with respect to each of its 
indices in turn, prefixing a positive sign when the index is contra- 
variant and a negative sign when it is covariant. 

Applying this rule to the tensor field whose components at every 
point are those of the fundamental tensor 5), it will be found that 


Sic = Ty 85TH 8) = Te - Th = 0. (35.10) 


Thus, the fundamental tensor behaves like a constant in covariant 
differentiation. 

Finally, in this section, we shall demonstrate that the ordinary rules 
for the differentiation of sums and products apply to the process of 
covariant differentiation. 

The right-hand member of equation (35.9) being linear in the tensor 
Ai, it follows immediately that if 


Ci = Ai+ Bi, (35.11) 
then Cj.~ = Aja t+ Bk (35.12) 
Now suppose that C! = A‘B’, | (35.13) 
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Then 


ac’ 
C5. = 


= 


= (AB) + I, AGB, 


= (4+ wet) Bs +I4,B a’) A 


= Aj, ,B/+B/,, Ai, (35.14) 
which is the ordinary rule for the differentiation of a product. 


36. Covariant differentiation of relative tensors 

We consider first the parallel displacement of an invariant density U 
from the point P to the neighbouring point P’, at which it will be taken 
to become 2+ 5%. It cannot be assumed that 5% = 0 for, even if the 
two invariant densities are identical in one frame, since the transfor- 
mation law at P is different from that at P’, they will cease to be 
identical when referred to another frame. If, however, jis Euclidean 
and we restrict the choice of frames to those which are rectangular 
Cartesian, then the transformation law for the density will be 


W= Dux, (36.1) 


where D is given by equation (32.1) and, in the case of orthogonal 
transformations, can take the values + 1 and — 1 only. In this special 
case, therefore, D does not depend upon the point at which is defined 
and 5% = Ocan be valid for all rectangular Cartesian frames. We shall 
accordingly define the parallel displacement of an invariant density in 
@y, relative to rectangular Cartesian frames, to be such that the 
density suffers no change. 

Let y’, y' + dy! be the coordinates of points P, P’ relative to rectangu- 
lar Cartesian axes in @y and let 8 be an invariant density calculated 
in this frame and associated with the point P. If this density is parallel 
displaced to P’, its value remains unchanged at 8, i.e. 58 = 0. Let 
x!, x!+ dx! be the coordinates of P, P’ respectively referred to any 
other coordinate frame (not necessarily Cartesian) and let 2% be the 
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same invariant density at P calculated in this new frame. Referred to 
the new frame, let 2+ 5% be this density after it has been parallel 
displaced to P’. Then 


W= DB, (36.2) 
where D = |@y!/ax/|. Hence 
1 
su = 8(DB) = 8D.B = —SDU. (36.3) 
7) 
But 5D = EP ie (36.4) 
ox 
and thus 8u = K;Udx', (36.5) 
10D 
where i= D ox! (36.6) 


Equation (36.5) is valid for the parallel displacement of an invariant 
density in a space /y which is Euclidean. If “yn is not Euclidean, the 
parallel displacement of an invariant density will be defined by the 
equation (36.5). Since Cartesian coordinates are not available in such 
a space, the coefficients K; cannot be derived from equation (36.6) and 
must, instead, be imposed upon the space by specification at every 
point (cf. an affinity). This specification will, for the moment, be 
assumed to be performed in an arbitrary manner in any one frame and, 
when this has been done, the K; will be determined in all frames. 

Suppose now that Y is an invariant density field and let U, &%+ dU 
denote the actual values taken by the density at the points x!, x!+ dx’. 
Let U be displaced to the point x'+dx', where it assumes the value 
W-+ 5H. Then the difference between the two densities defined at the 
point x'+ dx’, is itself a density and equals 


ou 
dX — SU = (S-xim) dx’, (36.7) 
But dx! being an arbitrary vector, it follows that 
ou 
U,; = ax! K,;&U (36.8) 


is a covariant vector density. This will be termed the covariant 
derivative of U. 
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Consider now 8}, a tensor density of the second rank. Let & be any 
invariant density. Then %~! is a relative invariant of weight — 1 and 
hence 

a-1 Bi = Bi (36.9) 
is a tensor. Rewriting this equation thus, 
Bi = UBi, (36.10) 
and subjecting it to parallel displacement, we find 
5B) = 5(Q1B)), 
= U5B)+ Bsa, 
= UI}, B}—- Ti, By) dx* + BK, Udx*, 
= (Yj, Bi-T};, 85+ K;, Bi) dx*, (36.11) 
where equations (35.8) and (36.5) have been employed in the third line 
of this argument. Equation (36.11) indicates that the law of parallel 
displacement for a tensor density is identical with that for a tensor 
except for the presence of an additional term involving the coefficient 
Kx. 

If Bi is a tensor density field, its covariant derivative can now be 
found in the usual way. It is easily shown that this derivative is given 
by 


8B} 


Bi, = ak t Tin Bi — Ij, Bi-K;, B} (36.12) 


and is a tensor density. a rule for differentiating any tensor density 
should now be clear after inspection of the last equation. 

Consider, in particular, the tensor density field whose components 
at every point are e’---”. Since these components are all constant, 
their partial derivatives are zero and, by the rule, the covariant 


derivative is 
ae — Tied "4 Tel 4 eb K, el, (36.13) 
If i,j, ..., 2 is an even permutation of 1, 2, ..., N, then 
Ti,e7---" = I, (mot summed over A), 
T/,e"--" = IJ, (not summed over j), 


} (36.14) 


etc. and hence 
ern = I¥,—K,, (36.15) 


GENERAL TENSOR CALCULUS. RIEMANNIAN SPACE 107 


where summation with respect to r is intended. If i, j, ..., 2 is an odd 


permutation, then ‘ 
een, = —(7s— Ks). (36.16) 


If a pair of i, j, ... mn are the same, e.g. i= j= P, then 
Tie "4 Die itet = PP erPe.ny PP Prin 
= PP oP mM pPerPe.n 
= 0, (36.17) 


where summation with respect to P is not intended. Also, the remain- 
ing terms of the right-hand member of equation (36.13) are then zero 
and it follows that, in this case, 


et 0. (36.18) 
Equations (36.15), (36.16), (36.18) can be summarized in the form 
ef" = (I%,—K) ef". (36.19) 


e/---" having the same components in every frame and at every 
point, it is natural to expect its covariant derivative to be identically 
zero. Such a result would certainly facilitate calculations into which 
this density enters, since it could then be treated as a constant with 
respect to covariant differentiation (cf. 54). Equation (36.19) shows 
that this desirable state of affairs can easily be achieved by taking 


K, = 7, (36.20) 


in the law defining parallel displacement of densities. This we shall 
accordingly do in all future developments. The covariant derivative 
of a tensor density as determined by equation (36.12), will then read 


emi 
jk = Geet Die Bi "xe Bi— I}, Bj. (36.21) 


The parallel displacement of any relative tensor of weight W can 
now be deduced. Let € be a relative invariant of weight W and UM an 
invariant density. Then €/%” is an invariant. Denoting this by V, we 
have 

c= uy. (36.22) 
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Performing a parallel displacement to a neighbouring point, we obtain 
S€ = 6M” V) = WU" —! du, (36.23) 


since 5V = 0. Substituting for 5% from equation (36.5) (using the 
above form for K;), it is found that 


8C = WI',U” Vax* = WTi, Cdx*. (36.24) 


This is the law for parallel displacement of a relative invariant of 
weight W and it will be noted that it is identical with the law (36.5) for 
the parallel displacement of a density, except for the occurrence of 
the numerical factor W. 

The derivation of the covariant derivatives of relative tensors now 
proceeds in the same manner as for densities. The covariant derivative 
of an invariant € of weight W proves to be 


ag 
C.,.= Pe WIi, €, (36.25) 


(cf. equation (36.8)), and 
i 


, Co, : 
sk = Zt a Gj Cr — WI, C} (36.26) 


is the covariant derivative of a relative tensor Ci of weight W (cf. 
equation (36.21)). 

The covariant relative tensor of weight — 1, which has earlier been 
denoted by ej... ,, can be regarded as a field and its derivative now 
found. It will be left as an exercise for the reader to show that this 
derivative is identically zero. The method used tu derive equation 
(36.19) should be employed. 


37. The Riemann-Christoffel curvature tensor 

If a rectangular Cartesian coordinate frame is chosen in a Euclidean 
space &y and if A’ are the components of a vector defined at a point Q 
with respect to this frame, then 84! = 0 for an arbitrary small parallel 
displacement of the vector from Q. This being true for arbitrary A’, it 
follows from equation (35.4) that Ij, = 0 with respect to this frame at 
every point of &),. Suppose C is a closed curve passing through Q and 
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that A' makes one complete circuit of C, being parallel displaced over 
each element of the path. Then its components remain unchanged 
throughout the motion and hence, if A'+ 4A! denotes the vector upon 
its return to Q, 


AA‘ = 0. (37.1) 


Since 4A! is the difference between two vectors both defined at Q, it 
is itself a vector and equation (37.1) will therefore be a vector equation 
true in all frames. Thus, in @y, parallel displacement of a vector 
through one circuit of a closed curve leaves the vector unchanged. 

If, however, 4! is defined at a point Q inanaffinely connected space 
SN, not necessarily Euclidean, it will no longer be possible, in gen- 
eral, to choose a coordinate frame for which the components of the 
affinity vanish at every point. As a consequence, if A’ is parallel dis- 
placed around C, its components will vary and it is no longer per- 
missible to suppose that upon its return to Q it will be unchanged, 
ie. 4A! 4 0. We shall now calculate 4A‘ when 4! is parallel displaced 
around a small circuit C enclosing the point P having coordinates x! 
(Fig. 5) at which it is initially defined. 

Let U be any point on this curve and let x'+ &! be its coordinates, 
the £' being small quantities. V is a point on C near to U and having 
coordinates x'+ €'+ dé!. When A! is displaced from U to V, its com- 
ponents undergo a change 


8A = —Tj, Aldé*, (37.2) 


where Ti, and 4/ are to be computed at U. Considering the small 
displacement from P to U and employing Taylor’s theorem, the value 
of Ij, at U is seen to be 


yet oT bee (37.3) 


to the first order in the €’. In this expression, the affinity and its deriva- 
tive are to be computed at P. A/ in equation (37.2) represents the 
vector after its parallel displacement from P to JU, i.e. it is 


—Tia' &, (37.4) 
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where A/, A’ and I, are all to be calculated at the point P. To the first 
order in ¢! therefore, equation (37.2) may be written 


fog ort fe os 
SAi = -| Ala Tha) eae (37.5) 
U 
V 
Cc 
Fic. 5 


Integrating around C;, it will be found that 
; ; or 
AA = —Ti, A? $ att+( re Lf 2) AJ 4 é'dé*, (37.6) 
Cc oe Cc 


where the dummy indices j, r have been interchanged in the final term 
of the right-hand member of equation (37.5). 


Now $ det = Agk = 0, (37.7) 
C 
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Also $ tele) = Ate! £4) = 0, (37.8) 
Cc 


so that $ eld = — 4 ekael, (7.9) 
Cc Cc 


implying that the left-hand member of this equation is skew-sym- 
metric in / and k. Since €', dé* are vectors, it is also a tensor. Denoting . 
it by a*!, we have 


all = 46 (¢lagh— ek atl (37.10) 
Cc 
and equation (37.6) then reduces to the form 
a 
AA = (rri- a) Al ak, (37.11) 


Apart from its property of skew-symmetry, «*! is arbitrary. None- 
theless, since it is not completely arbitrary, the quotient theorem 
(section 31) cannot be applied directly to deduce that the contents of 
the bracket in equation (37.11) constitute a tensor. In fact, this expres- 
sion is not a tensor. However, it is easy to prove that, if X#j is skew- 
symmetric with respect to k, J and if Y¥, defined by the equation 


YY = X$a*, (37.12) 


is a tensor for arbitrary skew-symmetric tensors «*!, then X/J is also 
a tensor. 
To prove this, let B*' be an arbitrary symmetric tensor. Then the 
components of the tensor 
yk! = okt Bk (37.13) 


are completely arbitrary, for, assuming k < /, 
yt = ok pe. yk a — ok! 4. pel (37.14) 


and it follows that the values of y*!, y/* can be chosen arbitrarily and 
then 
aM = Hy — yh), BH = Ay +"), (37.15) 


Le., it is only necessary to fix the values of «*!, B*' in the cases k < lin 
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order that the yi! shall assume any specified values over the complete 
range of its superscripts, with the exception of the cases nee two 
superscripts are equal. If the superscripts are equal, «“!=0 and 
y*! = B*, But these 6” are also arbitrary and hence so again are the 
if ' with equal superscripts. 
Since pet is symmetric. and Xj} is skew-symmetric, 


Xp = 0. (37.16) 
Adding equations (37.12) and (37.16), we obtain therefore 
Xfy" = 4, (37.17) 


But y“ is an arbitrary tensor and hence, by the quotient theorem, Xj 
is a tensor. 

The multiplier of «*! in equation (37.11) is not skew-symmetric in 
k, l. However, it can be made so as follows: Interchange the dummy 
indices k, / in this equation to obtain 


ar 
AA! = (rte) Ad al, (37.18) 


Adding equations (37.11) and (37.18) and noting that o*! = —«!*, it 
will be found that 


: a ar 
A = a(TTi -rir eat Sit) abel (37.19) 


The bracketed expression is now skew-symmetric in k, / and hence 


ari, a 
(Me 0i— DT aH) al (37.20) 


is a tensor. A/ being arbitrary, it follows that 
al}, Oj, 
Bia = PD i-TuTet 54 Gt (37.21) 


is a tensor. It is the Riemann-Christoffel Curvature Tensor. 
Equation (37.19) can now be written 


AA! = 4Bi,, Al a*, (37.22) 
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If Bi,; is contracted with respect to the indices i and J, the resulting 
tensor is called the Ricci Tensor and is denoted by Rj. Thus 


Ry = Bhi. (37.23) 


This tensor has an important role to play in Einstein’s theory of gravi- 
tation. Since Bj,; is skew-symmetric with respect to the indices k and 
1, its contraction with respect to i and k yields only the Ricci tensor 
again in the form — Rj. However, contraction with respect to the 
indices i and j yields another second rank tensor, viz. 


Sit = Bigg = —p~ i. (37.24) 


38. Geodesic coordinates. The Bianchi identities 
If the affinity Ij, is symmetric in its subscripts, it is always possible to 
find a coordinate frame in which all components of the affinity vanish 
at any chosen point. 

We shall first choose a coordinate frame in which the chosen point 
Pis the origin and hence has coordinates x' = 0. We then transform to 
new coordinates x’ by the equations 


x! = xit hal, x) x*, (38.1) 


where the aj, are constants whose values will be chosen later and we 
shall assume, without loss of generality, that ain is symmetric with 
respect to j and k. Differentiating the equations (38.1), we find that 


ax! ae oe 
ag Sit ai, &*, 
(38.2) 
ext 
axl agk — “Uk: 
At P x' = 0 and these reduce to 
Fy . ax! ; 
~ = 8! = al. (38.3) 
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ax! ax/ ‘ 
Now aging = oF (38.4) 
and hence, at the chosen point, 
. ax! 
ax! 
or axk = 5) . (38.6) 


Substituting appropriately in equation (34.8), we calculate that the 
components of the affinity at the point P in the x'-frame are given by 


Fi = 8/858! I%,4+ 84), 
= Tij+ai;. (38.7) 


Since the affinity is symmetric, it is now possible to choose the aj, 
to satisfy 
a = —I%;. (38.8) 
The transformation (38.1) is now determined completely and, by 
equation (38.7), 
Ii, = 0 (38.9) 
as required. 

The coordinates x are said to be geodesic at the point P. If such 
coordinates are employed, it is clear that covariant and partial de- 
riva@ ves are identical at P. This enables us to simplify many arguments 
leading -to tensor equations. However, if such equations can, by this 
means, be proved valid in the geodesic frame, they are necessarily 
valid in all frames. As an example, we will derive the Bianchi Identity. 

Thus, assuming the affinity is symmetric and employing geodesic 
coordinates at the point being considered, the covariant derivative of 
equation (37.21) is simply 


Fy ; aly, aly 
Birt:m = oa PAD Diet 5 Se st). 
er, rr, 


(38.10) 


> ax” axk ax” aay! 


since the Ij, (but not their derivatives necessarily) all vanish at this 
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point. Cyclically permuting the indices k, 1, m in equation (38.10), we 
obtain 
er ri, 
Bimsk = oR ax! oxF ax”? Gam 
@2 ri e2 Ti 
i Septet: peat LL 
Bimk:1 Ax! ax™ ax! axk oe) 
Addition of equations (38.10), (38.11), (38.12), yields the following 
identity 
Bit; m+ Bjm:k+ Bink ;1 = 9. (38.13) 


But this is a tensor equation and, having been proved true in the 
geodesic frame, must be true in all frames. Also, since the chosen point 
can be any point of /y, it is valid at all points of the space. It is the 
Bianchi Identity. 


39. Metrical connection. Raising and lowering indices 

In this section we shall further particularize our space S , by suppos- 
ing it to be Riemannian. That is, we shall suppose that a ‘ distance’ or 
interval ds between two neighbouring points x’, x'+ dx! is defined by 
the equation 


ds* = g,dx' dx, (39.1) 


where the N coefficients g;; are specified in some coordinate frame at 
every point of Sy. It will be assumed, without loss of generality, that 
the g;; are symmetric. Such a relationship between all pairs of adjacent 
points is called a metrical connection and the expression (39.1) for. ds” 
is termed the metric. 

For any two neighbouring points, ds will be regarded as an in- 
variant associated with them and the g, must accordingly transform 
so that this shall be so. Since dx!dx/ is an arbitrary symmetric tensor, 
gi is symmetric and ds* is an invariant, it follows by a modified quo- 
tient theorem similar to the one proved in section 37, that gj is a 
tensor. It is called the fundamental covariant tensor. The contravariant 
tensor which is conjugate to g;; (see section 31), viz. g, is termed the 
fundamental contravariant tensor. This exists only if g = |g,| + 0, which 
we accordingly assume to be the case. 
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Let A! be any contravariant vector defined at a point of #y. Then 
gi; A/ is a covariant vector at the same point and this will be denoted 
by Aj. Thus 

A; = gyA’. (39.2) 


We shall regard the A and A; as the contravariant and covariant 
components respectively, relative to the coordinate frame in use, of 
the same vector. The process defined by equation (39.2), of converting 
the contravariant expression for a vector into its covariant expression 
is termed lowering the index. 

If Bis a covariant vector, its contravariant expression is determined 
by raising the index with the aid of the fundamental contravariant 
vector. Thus 

Bi = gi B. (39.3) 


For the notation to be consistent, it is necessary that if an index is 
first lowered and then raised, the original vector should again be 
obtained. This is seen to be the case for, if A; is formed from A’ (equa- 
tion (39.2)), the result of raising its subscript is (equation (39.3)) 


g! A; = gg), A* = 5, A* = Al, (39.4) 
where equations (31.9) have been used in the reduction. Similarly, if 
an index is first raised and then lowered, the original covariant vector 
is reproduced. 


Any index of a tensor can now be raised or lowered in the obvious 
way. Thus, if A, is a tensor, we define 
Alin = 8, Ay. (39.5) 
To allow for the possibility that indices may be raised or lowered 
during a calculation, it will be convenient to displace the subscripts to 
the right of the superscripts. It is also often helpful to keep a record 
of these operations by: placing a dot in the gap resulting from the 
raising or lowering of an index. These conventions are illustrated in 
equation (39.5). 
Suppose an index of the fundamental tensor g;; is raised. The result 
is 
gk = gig, = d*,, (39.6) 
i.e. the mixed fundamental tensor. The same tensor results when an 
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index of gis lowered. If both subscripts of g;; are raised, the result is 
gl gt gi = gl 8°; = 2”, (39.7) 
Our notation is entirely consistent, therefore, and g;;, gl Oe are taken 
to be the covariant, contravariant and mixed components respectively 
of a single fundamental tensor. 
Consider the inner product of two vectors A’, B;. We have 
A'B; = g) Aye BY, 
= g! gi, A;B*, 
= 8), A, B*, 
= Ax BE 9 
= A,B’. (39.8) 
It is clear that the dummy index occurring in the expression for an 
inner product can be raised in one factor and lowered in the other 
without affecting the result. This is obviously valid for the inner pro- 


duct of any pair of tensors. 
In &y, if we confine ourselves to rectangular Cartesian frames, 


ds* = dx'dx' (39.9) 
ee ve T a (39.10) 
= 0, i#j. 
It will now be found that g = |g,;| = 1 and that 
gi=1, i=j. 
39.1 
=0, i#j. } oa 


Hence, if A; is a vector in the space, according to our definition its 
contravariant components will be 
Ai = gi A, = Aj, (39.12) 


confirming that there is no distinction between covariant and contra- 
variant vectors in this special case of @y. 


40. Scalar products. Magnitudes of vectors 
In &y, the magnitude of the displacement vector dx' is taken to be ds 
as given by equation (39.9). In #y, the magnitude of this vector is 
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taken to be ds as given by equation (39.1). If A’ is any other contra- 
variant vector, it may be represented as a displacement vector and 
then its magnitude is the invariant A, where 


A? = g, A! Ad, (40.1) 


This equation is accordingly taken to define the magnitude of A’, 
Raising and lowering the dummy indices in equation (40.1), we 
obtain the equivalent result 


A? = gi A; A). (40.2) 


It is natural to assume that the associated vectors A;, A’ have equal 
magnitudes and hence A is also taken to be the magnitude of Aj. 
Equation (40.2) indicates how this can be calculated directly from A;. 

Since g,A/ = A; and g" A, = A’, equations (40.1), (40.2) are also 
both seen to be equivalent to the equation 


A? = A, A‘. (40.3) 

The scalar product of two vectors A, B is defined to be the invariant 
A-B = 4,B' = A'B, = g,A' B! = g! A,B). (40.4) 

It will be noted that A’ = A-A, (40.5) 


By analogy with &3, we now define the angle 9 between two vectors 
A, B to be such that 


ABcos 6 = A-B. (40.6) 
A'B; 
I.e. cos 9 = ——___—_—_,_-: 40.7 
VITA) (BB Se 
If 6 = 47, the vectors are said to be orthogonal and 
A'B; = 0. (40.8) 


41. The Christoffel symbols. Metric affinity 
At any chosen point P of #y, as is proved in algebra texts, the homo- 
geneous quadratic form defining the metric can, by a regular linear 
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transformation from the coordinates x’ to coordinates y', be reduced 
to diagonal form thus 


gydx'dx! = dy')+@’*)*+...4@"). (41.1) 


The transformation may involve complex coefficients. The metric 
may be assumed to take this simplified form over a small neighbour- 
hood of P in which the y’ will behave like rectangular Cartesian co- 
ordinates in & y. In sucha neighbourhood, therefore, the components 
of the affinity I”, will be representable in the form given by equation 
(33.8) and hence must be symmetric in their subscripts at the point P. 
This will be true at every point of 2. We have established, therefore, 
that an affinity imposed upon a Riemannian space must be symmetric 
if parallel displacement in any small Euclidean region is to be in 
accordance with the usual definition for such a region. 

Apart from the fact that it is symmetric, the affinity is otherwise 
arbitrary. However, consider the covariant derivative A;, ,. Formally, 
this can be obtained from A’ in either of two ways, (i) by lowering the 
index i and then differentiating or (ii) by differentiating and then 
lowering the index. Unless these two operations commute, so that 
either process leads to the same result, confusion will clearly arise. It 
is convenient, therefore, to define the affinity, if possible, in such a 
way that these operations are commutative. Thus, we shall require 
that 


By An = (By A); % (41.2) 


Expanding the right-hand member by differentiation of the factors of 
the product, the condition becomes 


By Al.4 = By Al e+ By;:K A, 
or gyn! = 0. (41.3) 
This is to be true for arbitrary 4/ and thus 
Si:k = 0. (41.4) 


I.e. the affinity should be chosen so that the covariant derivative of 
the metric tensor gy is identically zero. This can be done as follows: 
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Cyclically permuting the indices i, j and k in equation (41.4), we 
derive the equations 


0g;; ; 

axk me ~LiKe8yj- Vie Sir = 0, 

Og jx 

get HBr —ThiBir = O, (41.5) 
OL: 


ext Ki Sri Vike = 0. 


Adding the last pair of this set of equations and subtracting the first, 
and remembering that g,,, [;, are symmetric, it will be found that 


Sarl ij = [iA (41.6) 
1 (Og; OR; 2%; 
where [i,k = 3( . ei _ Fe), (41.7) 


[ij,k] is termed the Christoffel Symbol of the First Kind. It is not a 
tensor, but its indices invariably behave like subscripts in any formula 
in which it occurs. It is symmetric in the indices i, j. 

Multiplying both members of equation (41.6) by g* and summing 
with respect to k, it follows that 


Dj = {i*}; (41.8) 


ax axk 


{°;} is called Christoffel’s Symbol of the Second Kind. It, also, is not 
a tensor and is symmetric with respect to the indices i, }. 

It is now easy to verify that, if the affinity is determined by equation 
(41.8), the condition (41.4) is satisfied. Further, since 


gil 8k = 54, (41.10) 


by taking the covariant derivative of both members of this equation, 
we obtain 


F F) de;; 
where {8} = e“i,k] = ie" (2 nt Li 4). (41.9) 


gi g, =0. (41.11) 
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Multiplying by g¢*’ and summing with respect to k, we then find that 
g’’., = 0. (41.12) 


Thus, the covariant derivative of the fundamental contravariant 
tensor is also identically zero. It now follows that 

(g4 Aj); 4 = 89 Ajzks (41.13) 
i.e. asubscript may be raised before or after a covariant differentiation 
without affecting the result. This is clearly true for tensors of any 
rank. 

The affinity determined by equation (41.8) will be called the metric 
affinity. With this choice of affinity, all forms of the fundamental 
tensor may be treated as constants in covariant differentiation. 

In &y, if a rectangular Cartesian coordinate frame is employed, the 
gi are constants (equations (39.10)) and the Christoffel symbols are 
all identically zero. It follows, therefore, that covariant derivatives 
then reduce to ordinary partial derivatives. Such derivatives were 
proved to be tensors relative to these frames in section 11. 


42. The covariant curvature tensor 
The components of Bis are not all independent since the tensor is 
skew-symmetric in the indices k, 1. In addition, however, if the 
affinity is symmetric, it is easily verified from equation (37.21) that 
Bin + Bhy+t Bij, = 0. (42.1) 
If the affinity is metrical, by lowering the contravariant index of the 
Riemann-Christoffel tensor, a completely covariant curvature tensor 
Bijx1 is derived. This has a number of symmetry properties, one of 
which is obtained from our last equation immediately by lowering 
the index i throughout to give 
Byrit Biuyt Bix = 0. (42.2) 


Further such properties can be established by first calculating a par- 
ticular expression for this covariant tensor. 
From equation (37.21), it follows that 


Q re) 
Bijxt = Lis ta (B-OBG + axk {B- Axl { | > (42.3) 
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Multiplying equation (41.9) through by g,, and summing with respect 
to s, we obtain the result 


Sri} = SLi, Kk] = L,I. (42.4) 


I.e. lowering an index of the symbol of the second kind, yields the 
symbol of the first kind. Equation (42.3) is therefore equivalent to 


Biya = (rk, 3 -(rh 3+ ee eit *N- 


— alent dee ta + Ee 


mney Ky. = (42.5) 
Now, from equation (41.7), it follows that 


es = [ik, s]+[sk, i] (42.6) 


and hence 
re) 
Byxat = (rk, i 3 —(rh it w+ x3 “Ubi Uk,il- 


— {Dh UK, 51+ [sk, i) + (77.3, 5] + [sl, iD, 
7) 7) 
= oF Lil, i]—- Byte, —{ 7 Dik, \+ {;°, Mil, sl, 


( Pe, Pe, 2, 8 es 


2\axiaxk © axtax! axtax* axJ ax! 
+ 5b 4d — orl HS D- (42.7) 
It is now clear that Bit = — Byixr, (42.8) 
Bixi = — Bites (42.9) 
Bix = Bry. (42.10) 


Equations (42.8), (42.9) indicate that By,; is skew-symmetric in its 
first and final pair of indices. 

Also, lowering the superscript i throughout the Bianchi identity 
(38.13), we obtain 


Biyxt;m+ Biytm;k+ Biymk;1 = 9. (42.11) 
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43. Divergence. The Laplacian. Einstein’s tensor 

If the covariant derivative of a tensor field is found and then con- 
tracted with respect to the index of differentiation and any superscript, 
the result is called a divergence of the tensor. With respect to ortho- 
gonal coordinate transformations in &y, the partial and covariant 
derivatives are identical and then this definition of divergence agrees 
with that given in section 12. 

From the tensor A’,, two divergences can be formed, viz. 


div, A4, = A¥,..; and div, A4, = A, j. (43.1) 
A contravariant vector possesses one divergence only, which is an 
invariant. If the affinity is the metrical one, such a divergence is simply 
expressed in terms of ordinary partial derivatives thus: Since a 


derivative of a determinant can be found by differentiating each row 
separately and summing the results, we deduce that 


4 ik Sik ik Bik 
—" 2 
ax/ ne xi ax4 2 


Substituting for @g;,/2x/ from equation (42.6), this reduces to 


a = ge"* (ij, k1+lki,i) = 2gf/}. (43.3) 


Hence Ui) = ve. (43.4) 


Now let 4’ be a vector field. Its aa is 


aA! 
A'., = a t dA’, 


1 aA! 

- J | vex 7+A Zwvel, 
1 2 
~ A/g ox 


which is the expression required. 


— (V/A), (43.5) 
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In particular, if the vector field is obtained from an invariant V by 
taking its gradient, we have 


A; = oe (43.6) 
OV 
and hence A = aioe (43.7) 


From equation (43.5), it now follows that the divergence of this vector 
is 


1 df: OV 
divgrad V = V7V = aio vee! a} . (43.8) 
x Ox 
The right-hand member of this equation represents the form taken by 


the Laplacian of Vin a general Riemannian space. In &y, employing 
rectangular axes, the equations (39.11) are valid and 


eV 
2V= 
VV= axt oxi? (43.9) 
which is its familiar form. 
We shall now calculate the divergence of the Ricci tensor Rj, 
(equation (37.23)). 


If the metric affinity is being employed, this tensor is symmetric 
for 
Rig = Big = 8" Brgi = 8" Birk = 8" Bir 
= B’ ix, = Rigs (43.10) 
having employed equations (42.8)-(42.10). Raising either index ac- 
cordingly yields the same mixed tensor Rj. If this is contracted, an 


invariant ; 
R= Rj (43.11) 


is obtained. R is called the curvature scalar of Ry. 
The divergence of the Ricci tensor is 


Ri: m = a” Rims 
oe 2 Bhi. ms 
= git gik Batis. m: (43.12) 
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In view of equation (42.10), the Bianchi identity (42.11) can be written 
in the form 


Brty;m+ Bim kt Bmkij31 = 0 (43.13) 
and it follows that 


— og Binij:4+ Bmkij;))> 

= —g"" gI(Biyik— BmkiisD> 

= —3!*(Blyi..— Biuai;) 

= —g*(Ry~— Rig; 

= —Rh +R.) (43.14) 


10R 
2 ax! 


7 
l;m 


Thus Rim = FR}. = (43.15) 


is the divergence of the Ricci tensor. 
Consider now the mixed tensor 


Rj— 45; R. (43.16) 
| OR 
Its divergence is Ri, ;- wet 
, oR 
— Rta 
= 0. (43.17) 


This is Einstein’s Tensor. Its covariant and contravariant components 
are 


Ry— 38, R, R¥—ig' R, (43.18) 
respectively. Upon contraction, it yields the invariant 
R-—4NR = —3(N—2)R. (43.19) 


44. Geodesics 
Let C be any curve constructed in a space # y having metric (39.1) and 
let s be a parameter defined on C such that, if s, s+ ds are its values at 
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the respective neighbouring points P, P’ on C, then ds is the interval 
between these two points. If x' are the coordinates of any point P on 
C, then the curve will be defined by parametric equations 


x! = x"(s). (44.1) 


Since dx! are the components of a vector and ds is an invariant, 
dx'/ds is a contravariant vector at P. Its magnitude is, by equation 
(40.1), ee 

dx' dx! 
(cx is =) (44.2) 
and this is unity by equation (39.1). dx'/ds is termed the unit tangent 
to the curve at P, its direction being that of the displacement dx’ along 
the curve from P. 

Suppose C possesses the property that the tangents at all its points 
are parallel, i.e. the curve’s direction is constant over its whole length. 
This property is clearly quite independent of the coordinate frame 
being employed. In &3, such a curve would, of course, be a straight 
line. In #y, the curve will be called a geodesic. A geodesic is accord- 
ingly the counterpart of the Euclidean straight line in a Riemannian 
space. Suppose P, P’ are neighbouring points on a geodesic having 
coordinates x', x'+ dx! respectively. If the unit tangent at P is parallel 
displaced to P’, it will then be identical with the actual unit tangent at 
this point. Now, by equation (35.4), after parallel displacement from 
P to P’, the unit tangent has components 


ry 


dx! _[dx'\ — dx! dx! 
si | (tl Weep, Nl 2 44, 
ds * (=) ds as & oo) 
But the actual unit tangent for the point P’ has components 
dx! dx! d?x! 
(=). = a t ae (44.4) 


The vectors (44.3), (44.4) are identical provided 
dx! dx/ dx* 
ae ae da 
If these equations are satisfied at every point of the curve (44.1), it is 
a geodesic. We shall assume, in future, that the affinity is metrical. 


(44.5) 
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The N equations (44.5) are second order differential equations for 
the functions x‘(s) and their solution will involve 2N arbitrary con- 
stants. If A, B are two given points having coordinates x! = a’, x! = b! 
respectively, the 2N conditions that the geodesic must contain these 
points will, in general, determine the arbitrary constants. Hence there 
is, in general, a unique geodesic connecting every pair of points. 
However, in some cases, this will not be so. For example, the geodesics 
on the surface of a sphere (#,) are great circles and, in general, there 
are two great circle arcs joining two given points, a major arc and a 
minor arc. Also, if these points are diametrically opposed to one 
another, there is an infinity of great circle arcs connecting them. 

Since dx'/ds is everywhere a unit vector, on a geodesic 


dx! dx/ 
ee 44, 
Si As. ds (44.6) 
This must, accordingly, be a first integral of the equations (44.5). To 
show that this is the case, multiply equations (44.5) through by 

2g;,dx"/ds and sum with respect to i to obtain 


dx’ d7x i , dd dx* dx" _ 
281 gst + Bik Te Ge ds = (44.7) 
dx'd*x' d(_ dx'dx"\ dg,,dx'dx, : 
MOW. = ee ag = 5 (en =)- sada 
dx] dx* dx" dx} dx¥ dx’ 
Al ri —_—_ —_— = 7] ee 
$0 28ir! ik G5 ds ds OG os as 
= 
= (ik, A+ bk DS >>: 
ds ds 
= 2p, det dx dx” 
ax* ds ds ds’ 
I Ay? 
_ Beds! dx Gis, 


ds ds ds 
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By addition of equations (44.8) and (44.9), it will be seen that 
equation (44.7) can be expressed in the form 


d(— dx'dx/ 
5 (ee =) (44.10) 
Upon integration, there results the first integral 
dx! dx/i 
875 Wen constant. (44.11) 


The constant of integration must, of course, be taken to be unity. 
The definition of a geodesic which has been given at the beginning 
of this section cannot be applied to the class of curves for which the 
interval ds between adjacent points vanishes. For such a curve, the 
parametric representation (44.1) is not appropriate and a unit tangent 
cannot be defined. Instead, suppose that a (1-1) correspondence is set 
up between the points of the curve and the values of an invariant A in 
some interval Ay < A < Aj, so that parametric equations for the curve 
can be written 
x! = x/(). (44.12) 


It will be assumed that the derivatives dx'/dA all exist at each point of 
the curve. These derivatives constitute a contravariant vector and this 
has zero magnitude for, since ds = 0 along the curve, 


SUT dA 


This vector will be in the direction of the displacement vector along 
the curve dx’ and will be called a zero tangent to the curve. The curve 
will be termed a null geodesic if the zero tangents at all points of the 
curve are parallel. This implies that, when the zero tangent at P is 
parallel displaced to the adjacent point P’, it must be parallel to the 
zero tangent at this latter point, and since the magnitudes of these 
two vectors at P’ are the same, they will be taken to be identical. The 
condition for this to be so is found, as before, to be 

d?x! _, dxidx* 


ae GN GA 


(44.13) 


0. (44.14) 
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These are, therefore, the equations of the null geodesics. It may now 
be shown, by an argument similar to that culminating in equation 
(44.11), that a first integral of these equations is 
dx! dx! 
Su dr 
In this case the constant must be zero. 

Equation (44.5) may be put in an alternative form which is more 
convenient for particular calculations, as follows: Multiply through 
by 2g,; and sum with respect to i; the resulting equation is equivalent 
to 


= constant. (44.15) 


d dx' dg,,;dx! , axi dx* 
a —— )-2 ——— +22, j,.>— — = 9. : 
| sd a) tha aa 
Now 
Bride! _  Bridxt dx’ 
ds ds ax* ds ds 
= og; OL rk dx) dx* 
_ (Fe fe) ds ds a 
and 
. Of; 8re BK 
— —_ Dig EN SS * 
287i Li, = Lik, r] = axk t axi ax" (44 18) 
Equation (44.16) accordingly reduces to 
d[, dx'\ agydxidx* _ 
a er | ox’ ds ds ue) 


Exercises 5 | 

1. A, is a covariant tensor. If B; = A;;, prove that By is a covariant 
tensor. Deduce that, if A;; is symmetric (or skew-symmetric) in one 
frame, it is symmetric (or skew-symmetric) in all. [Hint : The equations 
Ay = Aji, Ay = — Aj are tensor equations.] 

2. (x, y, z) are rectangular Cartesian coordinates of a point P in 
é, and (r, 9, ¢) are the corresponding spherical polars related to the 
Cartesians by equations (29.3). A is a contravariant vector defined at 
P having components (A*, A”, A’) in the Cartesian frame and com- 
ponents (A’, AP, A?) in the spherical polar frame. Express the polar 
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components in terms of the Cartesian components. O1, O2, O3 are 
rectangular Cartesian axes such that P lies on O1 and O3 lies in the 
plane Oxy. If (A', A”, A>) are the components of A in this Cartesian 
frame, show that 


Al= A’, A? =rA§, A = rsinOA?. 
{Note: Assume the Cartesian axes are right-handed.] 
3. If A; is a covariant vector, verify that Bj = A ,— A, ; transforms 
like a covariant tensor. (This is curl A.) If A is the gradient of a scalar, 


verify that its curl vanishes. 
4. If Aj is a skew-symmetric covariant tensor, verify that 


Bij = Ay, nt Ajy,it Ani; 


transforms as a tensor. 

5. gy is the metric tensor of an #3. If g = |g,|, show that e”, e,;,, 

where 

ek = elk ia/g, ey, = V(8) Cus 

are tensors in the 23. A; is a skew-symmetric tensor in the space. 
Deduce that 4e!* 4 [jx is a contravariant vector whose components are 
Ap3/Vg, A3i/Vg, Ai2/+/g. In particular, taking Ay = B;C;—B,C,, 
show that the contravariant vector is the vector product of the co- 
variant vectors B;, C;, if the space is Euclidean. (If the space is not 
Euclidean, this is taken to be the definition of the vector product of 
covariant vectors.) 

6. A# is a skew-symmetric tensor in #. Show that 4e,;;, AF is a 
covariant vector whose components are A?°4/g, A?!4/g, A!24/g, 
Hence, define the vector product of two contravariant vectors. 

7. Show that 

Aj,j— 4j;1 = Ajj—Aj,i 
provided the affinity is symmetric. 

8. Show that 

Aj; jk — Ajz ig = Bij Ar+ Vig— Tj) Air 


and deduce that Bj, is a tensor and that covariant differentiations are 
commutative in a space for which Bj, = 0 and the affinity is sym- 
metric. Obtain the corresponding result for a contravariant vector A . 

9. A_ is defined at the point x! and is parallel displaced around a 


GENERAL TENSOR CALCULUS. RIEMANNIAN SPACE 131 


small contour enclosing the point. Prove that the increment in A’ 
resulting from one circuit is given by 


AA, = —4}Bij, A, 0%, 


where «/* is defined by equation (37.10). 
10. The parametric equations of a curve in /y are 
xi = x0); 
tis an invariant parameter. A tensor Aj is defined over a region con- 
taining the curve. P, P’ are neighbouring points t, + 4t on the curve 
and 4A} is defined to be the difference between the actual value of the 
tensor at P’ and the value of the tensor at P after it has been parallel 
displaced to P’. Prove that 
AA; dx* 


DA; _ ,. 4A; _ yi OX, 
Dr eae ae 
(DAi/Dt is called the intrinsic derivative of the tensor along the curve.) 

11. Verify that {,/,}, as defined by equation (41.9), transforms as an 
affinity. 

12. If Aj is symmetric, prove that Ay; ; is symmetric in i and j. 

13. Show that the number of the components of B/,; which may be 
assigned values arbitrarily is, in general, }N*(N— 1). If the affinity is 
symmetric, show that this number is $}N7(N?—1). [Hint: Use 
equation (42.1).] 

14. Show that the number of the components of Bj; which may 
be assigned values arbitrarily is N?(N?— 1)/12. [Hint: Use equations 
(42.2), (42.8), (42.9), (42.10).] 

15. By differentiating the equation 


sign = dh 
with respect to x!, show that 
agi™ ky Bik 
ax! 7 g" “ax! 


and hence that 
im 


a 
Sy tel M tet = 0. 
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Deduce that g"’,, = 0. 
16. If the affinity is the metric one, prove that 
2 


P ae 7) 
Rie = Bua = — Tali + F754 108 Vet 


: a 
+f Hy (74d Ont log Vg, 


Ski = Big = 9. 


[Hint: Employ equation (43.4).] Deduce that Rj, is symmetric. 
17. If 0, ¢ are co-latitude and longitude respectively on the surface 
of a sphere of unit radius, obtain the metric 


ds? = d6*+sin? 0d¢” 


for the surface. Show that the only non-vanishing three index symbols 
for this Z, are 


G12} = —sin@cos8, {172} = 274} = cotd. 
Show also that the only non-vanishing components of Bj;,,; are 
Byoi2 = — By221 = Boi, = —Baii2 = sin’ 


and that the components of the Ricci tensor are given by 
Ry2 = R21 = 0, Ri == 1, R22 = — sin? 0, 


Prove that the curvature scalar is given by R= —2. 

18. Employing equation (43.8), obtain expressions for V?V in 
cylindrical and spherical polars. 

19. In a certain coordinate system 


where ¢, ¥ are functions of position. Prove that Bj,1 is a function of 
# only. If ¥ = —log(a;x’) prove that 
Ry = B i = Q, 
(L.U.) 
20. In the 2, whose metric is 
24 2 492 24,2 
ds? = sane Si ce i (r > a), 
r—a (r?— a’) 
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prove that the differential equation of the geodesics may be written 


dr\? 
2 = 2,4 
@(§) rar k*r 


where k? is a constant such that k? = 1 if, and only if, the geodesic is 
null. By putting rd@/dr = tan¢, show that if the space is mapped ona 
Euclidean plane in which r, 6 are taken as polar coordinates, the 
geodesics are mapped as straight lines, the null-geodesics being 
tangents to the circle r = a. 


(L.U.) 
21. A 2-space has metric 
ds* = g4,(dx')? + g20(dx*)*. 
Prove that 
Looe J ahs A ee 
g we 2/g\ax'!\4/g ax! 2\a/g “Ox? 
(L.U.) 
22. Prove that 
, ij 1 
@) Aly = a ived y+ a oy 
(ii) X44, = 0, 


provided X¥ is skew-symmetric. Hence prove that, for any tensor A’ 
AY, = AY ji 
(L.U,) 


23. Acurve C has parametric equations 
x! = x'(1) 


and joins two points A and B. The fai of the curve is defined to be 


d dx' dx’) 5 
ace Sut dt 
Write down the Euler conditions that L should be stationary with 


respect to all small variations from C and by changing the indepen- 
dent variable in these conditions from ¢t to s, show that they are 
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identical with equations (44.5). (This provides an alternative definition 
fora geodesic.) 
24. If Ij, is a symmetric affinity, show that 


Tix = Vij, + 8) Ay + 8h A; 


is also a symmetric affinity. 
If Bix, Bi , are the Riemann-Christoffel curvature tensors relative 
to the affinities I7;,, "k respectively, prove that 


Bigy = Big + 84, Ay — 8} Aj + 8{(Ani— Ap), 


where Ai = A; A;— Aj;;- 
Hence show that if A; is the gradient of a scalar, then 


(M.T.) 


25. Prove. that the affinity transformations form a group. 
26. If D=|ax'/ax/), show that 
18D axi ax 


D ax! axk ax! ax/ 
27. Show that the transformation law for the quantities K; 
(equation (36.5)) is 
— Oxi axl arxk 
Ki = aah t ook ost ogl 
and deduce that I7,— K; is a tensor. 

28. Oblique cartesian axes are taken in a plane. Show that the 
contravariant components of a vector A can be obtained by project- 
ing a certain displacement vector on to the axes by parallels to the 
axes and the covariant components by projecting by perpendiculars 
to the axes. 

29. Define coordinates (7,4) on a right circular cone having semi- 
vertical angle « so that the metric for the surface is 


ds? = dr2+r2sin? «d¢2. 
Show that the family of geodesics is given by 


r = asec(?sin «— ), 
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where a,f are arbitrary constants. Explain this result by developing 
the cone into a plane. 
30. An &y has metric 


ds? = erdx! dx', 
where A is a function of the x'. Show that the only non-vanishing 
Christoffel symbols of the second kind are 


P\_ psp _ P\ | 
loo} = 2D» [pa] ~ Ho 


where A, = @A/ax". Deduce that 
ijfr 
= A2—4A,A 
fe re 4(N+2) P 4 rr 
and that the scalar curvature of this space is given by 
R = (N-1)e[,,, + HN—2)A,A,] 


where A,, = 82A/ax" ax". 
31. @ is the colatitude and ¢ is the longitude on a unit sphere, so 
that the metric for the surface is 


ds? = d@2+sin? 0 d¢?. 


The covariant vector A; is taken with initial components (X, Y) and is 
carried, by parallel displacement, along an arc of length sin « of the 
circle 6 = «. Show that the components of A; attain the final values 


A; = Xcos(¢cos«)+ Ycosec «sin (¢cos a), 
Az = —Xsinasin(dcos«)+ Ycos(¢cos «). 


Verify that the magnitude of the vector A; is unaltered by the displace- 
ment. 
32. An &; has metric 


ds? = Adr2+ r2(d62 + sin? 0d¢2), 


where A is a function of r alone. Show that, along the geodesic for 
which @ = 47, d0/ds = 0 at s = 0, 


= f mnay 


where r = bsec yb. Interpret this result geometrically when A = 1. 
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33. y' @ =1,2,3,4) are rectangular cartesian coordinates in &,. 
Show that 


y! = Roos 8, 

y? = Rsin 8cos ¢, 

y>? = Rsin Osindcos yz, 
y* = Rsin Osin¢gsin yp, 


are parametric equations of a hypersphere of radius R. If (6, ¢d, w) are 
taken as coordinates on the hypersphere, show that the metric for 
this Z; is 
ds* = R?[d62+ sin? 0(dg2 + sin? ¢ dib?)). 
Deduce that in this #3, 
By212 = R’sin?6, B 323 = R?sin4 @sin? ¢, 

B3131 = R?sin? Osin2 g, 
all other distinct components being zero. Hence show that 

Bit = K( Six j1- Fukjx) 


where K = 1/R?. (This is the condition for the space to be of constant 
Riemannian curvature K.) 
34. An &, has metric 


ds* = sech? y(dx? + dy). 


Find the equation of the family of geodesics. 

6, ¢ are colatitude and longitude respectively on the surface of a 
sphere of unit radius. Mercator’s projection is obtained by plotting 
x,y as rectangular cartesian coordinates in a plane, taking 


x=¢, y= logcot4é. 


Calculate the metric for the spherical surface in terms of x and y 
and deduce that the great circles are represented by the curves 


sinh y = «sin(x+), 


where a, B are parameters, in Mercator’s projection. 


CHAPTER 6 
General Theory of Relativity 


45. Principle of equivalence 

The special theory of relativity rejects the Newtonian concept of a 
privileged observer, at rest in absolute space, and for whom physical 
laws assume their simplest form and assumes, instead, that these laws 
will be identical for all members of a class of inertial observers in 
uniform translatory motion relative to one another. Thus, although 
the existence of a single privileged observer is denied, the existence of 
a class of such observers is accepted. This seems to imply that, if 
all matter in the universe were annihilated except for a single experi- 
menter and his laboratory, this observer would, nonetheless, be able 
to distinguish inertial frames from non-inertial frames by the special 
simplicity which the descriptions of physical phenomena take with 
respect to the former. The further implication is, therefore, that 
physical space is not simply a mathematical abstraction which it is 
convenient to employ when considering distance relationships be- 
tween material bodies, but exists in its own right as a separate entity 
with sufficient internal structure to permit the definition of inertial 
frames. However, all the available evidence suggests that physical 
space cannot be defined except in terms of distance measurements 
between physical bodies. For example, sucha space can beconstructed 
by setting up a rectangular Cartesian coordinate frame comprising 
three mutually perpendicular rigid rods and then defining the co- 
ordinates of the point occupied by a material particle by distance 
measurements from these rods in the usual way. Physical space is, 
then, nothing more than the aggregate of all possible coordinate 
frames. A claim that physical space exists independently of distance 
measurements between material bodies, can only be substantiated if 
a precise statement is given of the manner in which its existence can 
be detected without carrying out such measurements. This has never 
been done and we shall assume, therefore, that the special properties 
possessed by inertial frames must be related in some way to the 
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distribution of matter within the universe and that they are not an 
indication of an inherent structure possessed by physical space when 
it is considered apart from the matter it contains. This line of argument 
encourages us to expect, therefore, that, ultimately, all physical laws 
will be expressible in forms which are quite independent of any co- 
ordinate frame by which physical space is defined, i.e. that physical 
laws are identical for all observers. This is the general principle of 
relativity. This does not mean that, when account is taken of the 
actual distribution of matter within the universe, certain frames will 
not prove to be more convenient than others. When calculating the 
field due to a distribution of electric charge, it simplifies the calcula- 
tions enormously if a reference frame can be employed relative to 
which the charge is wholly at rest. However, this does not mean that 
the laws of electromagnetism are expressible more simply in this 
frame, but only that this particular charge distribution is then des- 
cribed more simply. Similarly, we shall attribute the simpler forms 
taken by some calculations when carried out in inertial frames, to the 
special relationship these frames bear to the matter present in the 
universe. Fundamentally, therefore, all observers will be regarded as 
equivalent and, by employing the same physical laws, will arrive at 
identical conclusions concerning the development of any physical 
system. 

The main difficulty which arises when we try to express physical 
laws so that they are valid for all observers is that, if test particles are 
released and their motions studied from a frame which is being 
accelerated with respect to an inertial frame, these motions will not 
be uniform and this fact appears to set such frames apart from in- 
ertial frames as a special class for which the ordinary laws of motion 
do not apply. However, by a well-known device of Newtonian 
mechanics, viz. the introduction of inertial forces, accelerated frames 
can be treated as though they were inertial and this suggests a way out 
of our difficulty. Thus suppose a space rocket, moving in vacuo, is 
being accelerated uniformly by the action of its motors. An observer 
inside the rocket will note that unsupported particles experience an 
acceleration parallel to the axis of the rocket. Knowing that the 
motors are operating, he will attribute this acceleration to the fact 
that his natural reference frame is being accelerated relative to an 
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inertial frame. However he may, if he prefers, treat his reference frame 
as inertial and suppose that all bodies within the rocket are being sub- 
jected to inertial forces acting parallel to the rocket’s axis. If f is the 
acceleration of the rocket, the appropriate inertial force to be applied 
to a particle of mass m is — mf. Similarly, if the rocket’s motors are 
shut down but the rocket is spinning about its axis, an observer within 
the rocket will again note that free particles do not move uniformly 
relative to his surroundings and he may again avoid attributing this 
phenomenon to the fact that his frame is not inertial, by supposing 
certain inertial forces (viz. centrifugal and Coriolis forces) to act upon 
the particles. Now it is an obvious property of each such inertial 
force that it must cause an acceleration which is independent of the 
mass of the body upon which it acts, for the force is always obtained 
by multiplying the body’s mass by an acceleration independent of the 
mass. This property it shares with a gravitational force, for this also 
is proportional to the mass of the particle being attracted and hence 
induces an acceleration which is independent of this mass. This inde- 
pendence of the gravitational acceleration of a particle and its mass 
has been checked experimentally with great accuracy by E6tvés. If, 
therefore, we regard the equivalence of inertial and gravitational 
forces as having been established, inertial forces can be thought of as 
arising from the presence of gravitational fields. This is the Principle 
of Equivalence. By this principle, in the case of the uniformly acceler- 
ated rocket, the observer is entitled to neglect his acceleration relative 
to an inertial frame, provided he accepts the existence of a uniform 
gravitational field of intensity —f parallel to the axis of the rocket. 
Similarly, the observer in the rotating rocket may disregard his motion 
and accept, instead, the existence of a gravitational field having 
such a nature as to account for the centrifugal and Coriolis forces. 

By appeal to the principle of equivalence, therefore, an observer 
employing a reference frame in arbitrary motion with respect to an 
inertial frame, may disregard this motion and assume, instead, the 
existence of a gravitational field. The intensity of this field at any point 
within the frame will be equal to the inertial force per unit mass at the 
point. By this device, every observer becomes entitled to treat his 
reference frame as being at rest and all observers accordingly become 
equivalent. However, the reader is probably still not convinced that 
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the distinction between accelerated and inertial frames has been effec- 
tively eliminated, but only that it has been concealed by means of a 
mathematical device having no physical significance. Thus, he may 
point out that the gravitational fields which have been introduced to 
account for the inertial forces are ‘fictitious’ fields, which may be com- 
pletely removed by choosing an inertial frame for reference purposes, 
whereas ‘real’ fields, such as those due to the Earth and Sun, cannot be 
so removed. He may further object that no physical agency can be held 
responsible for the presence of a ‘fictitious’ field, whereas a ‘real’ field 
is caused by the presence of a massive body. These objections may be 
met by attributing such ‘fictitious’ fields to the motions of distant mas- 
ses within the universe. Thus, if an observer within the uniformly ac- 
celerated rocket takes himself to be at rest, he must accept as an observ- 
able fact that all bodies within the universe, including the galaxies, 
possess an additional acceleration of —f relative to him and to this 
motion he will be able to attribute the presence of the uniform gravita- 
tional field which is affecting his test particles. Again, the whole uni- 
verse will be in rotation about the observer who regards himself and 
his space-ship as stationary when it is in rotation relative to an inertial 
frame. It is this rotation of the masses of the universe which we shall 
hold responsible for the Coriolis and centrifugal gravitational fields 
within the rocket. But, in addition, these ‘inertial’ gravitational fields 
will account for the motions of the galaxies as observed from the non- 
inertial frame. Thus, for the observer within the uniformly accelerated 
rocket a uniform gravitational field of intensity —f extends over the 
whole of space and is the cause of the acceleration of the galaxies; for 
the observer within the rotating rocket, the resultant of the centrifugal 
and Coriolis fields acting upon the galaxies is just sufficient to account 
for their accelerations in their circular orbits about himself as centre 
(the reader should verify this, employing the results of Exercise 1, 
Chapter 1). On this view, therefore, inertial frames possess particu- 
larly simple properties only because of their special relationship to 
the distribution of mass within the universe. In much the same way, 
the electromagnetic field due to a distribution of electric charge takes 
an especially simple form when described relative to a frame in which 
all the charges are at rest (assuming such exists). If any other frame is 
employed, the field will be complicated by the presence of a magnetic 
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component arising from the motions of the charges. However, this 
magnetic field is not considered imaginary because a frame can be 
found in which it vanishes, whereas for certain magnetic fields such a 
frame cannot be found. The laws of electromagnetism are taken to be 
valid in all frames, though it is conceded that, for solving particular 
problems, a certain frame may prove to be pre-eminently more con- 
venient than any other. Neither, therefore, should the centrifugal and 
Coriolis fields be dismissed as imaginary solely because they can be 
removed by proper choice of a reference frame, although it may be 
convenient to make such a choice of frame when carrying out par- 
ticular computations. In short, the general principle of relativity can 
be accepted as valid and, at the same time, the existence of the inertial 
frames accounted for by the simplicity of the motions of the polactie 
masses with respect to these particular frames. 

If it is accepted that the existence of inertial frames is bound up with 
the large-scale distribution of matter within the universe, then it fol- 
lows that the inertia possessed by a body, which causes it to move 
uniformly in such a frame, is also a consequence of this matter 
distribution. This is Mach’s Principle, viz. that mass is induced in a 
body by the presence of distant matter in the universe. 

The previously unexplained identity of inertial and gravitational 
masses is easily deduced as a consequence of the principle of equiv- 
alence. For, consider a particle of mass m which is being observed 
from a non-inertial frame. A gravitational force equal to the inertial 
force will be observed to act upon this body. This force is directly pro- 
portional to the inertial mass m. But, by the principle of equivalence, 
all gravitational forces are of the same nature as this particular force 
and will, accordingly, be directly proportional to the inertial masses of 
the bodies upon which they act. Thus the gravitational ‘charge’ of a 
particle, measuring its susceptibility to the influences of gravitational 
fields, is identical with its inertial mass and the identity of inertial and 
gravitational masses has been explained in a straightforward and 
convincing manner. 


46. Metric in a gravitational field 
Suppose that an inertial frame has been established in a region of 
space where there are no local gravitational influences and that a 
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material plane is set rotating with angular velocity w relative to the 
frame about an axis perpendicular to it and fixed in the inertial frame. 
An observer O, moving with the plane, is entitled to consider himself as 
at rest in the presence of a gravitational field which will account for the 
centrifugal and Coriolis forces. Suppose O identifies a large number of 
points on the plane which he measures with a standard rod to be at the 
same distance r from the axis of rotation and, by this means, constructs 
a circle. If, now, O lays his measuring rod repeatedly along a radius of 
this circle to measure its length r and this operation is watched by an 
observer O’ who is stationary in the inertial frame, O’ will agree with 
the length found by.O, for the rod will only move laterally during the 
process of measurement and hence will suffer no Fitzgerald contrac- 
tion of its length. O’ will therefore agree with O that all the radii are 
of equal length and hence that the figure constructed is a circle. How- 
ever, if O now lays his rod along the circumference of the circle (we 
assume the dimensions of the rod are small by comparison with those 
of the circle) and measures its length to be /, O’ will disagree with this 
measurement for the reason that, for him, throughout the measuring 
process the rod will have a speed wr along its length and hence will 
have contracted by the factor (1 — w?r*/c*)'/?. Allowing for this con- 
traction, O’ will assert the length of the circumference to be 


I — w r?/c*)'/?, (46.1) 


But O’ is employing an inertial frame in which the geometry is known 
to be Euclidean. It follows that 


Ka —w?r?/c?)!/? = 2ar (46.2) 


2nr 


err 


This last equation implies that for O the ratio //2r is not 7, but a 
number greater than 7 and the larger the radius of his constructed 
circle, the larger this ratio will become. By direct measurement, there- 
fore, O will be able to ascertain that the geometry of figures drawn on 
the plane is not Euclidean. We conclude that, in the presence of a 
gravitational field of the centrifugal—Coriolis type, the geometry of 
space is not Euclidean. 


and hence that l (46.3) 
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By the principle of equivalence, the conclusion which has just been 
reached concerning the non-Euclidean nature of space in which there 
is present a gravitational field of the centrifugal—Coriolis type, must 
be extended to all gravitational fields. However, in the case of a field 
such as that which surrounds the Earth, it will not be possible (as it is 
for the centrifugal—Coriolis field) to find an inertial frame of reference 
relative to which the field vanishes and for which the spatial geometry 
is Euclidean. Such a field will be termed irreducible. Even in an irre- 
ducible field, however, a frame can always be found which is inertial 
for a sufficiently small region of space and a sufficiently small time 
duration. Thus, within a space-ship which is not rotating relative to 
the extra-galactic nebulae and which is falling freely in the Earth’s 
gravitational field, free particles will follow straight line paths at con- 
stant speed for considerable periods of time and the conditions will be 
inertial. A coordinate frame fixed in the ship will accordingly simulate 
an inertial frame over a restricted region of space and time and its 
geometry will be approximately Euclidean. 

Since a rectangular Cartesian coordinate frame can be set up only 
in a space possessing a Euclidean metric, this method of specifying 
the relative positions of events must be abandoned in an irreducible 
gravitational field (except over small regions as has just been ex- 
plained). Instead, it will be assumed that each physical event is 
allotted three space coordinates (£1, £7, £3) according to any con- 
venient scheme. It will be necessary in any particular case only to 
describe the procedure whereby these coordinates are to be established 
for any event. They might, for example, be determined by radar tech- 
niques. Thus radar transmitters could, in principle, be set up at three 
widely separated points and the electromagnetic pulses generated by 
them reflected back to their sources by the event in question. The time 
intervals between transmission of a pulse and reception of its ‘echo’ 
at the three stations, would then be suitable coordinates of the type 
we are considering for the event. All that is necessary is that there 
should exist a correspondence between points in space and triads of 
real numbers (£', £7, €*) such that, to each point there corresponds a 
unique triad, and to each triad there corresponds a unique point. 
Again, clocks will be supposed distributed throughout space so that 
to each event a time ¢* can be allocated, namely, the time indicated by 
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the clock situated at the event. It will be assumed that, whatever system 
is adopted for fixing the coordinates é , a series of contiguous events, 
such as the positions occupied by a particle at successive instants of 
time, will have coordinates which will vary continuously along the 
series and which will correspond to a continuous curve in £-space. 
This implies that adjacent clocks must be synchronized and must run 
at the same not necessarily constant rate, but no special procedure 
for the synchronization of distant clocks need be laid down. It follows 
that, as in special relativity theory, two distant events may be simul- 
taneous according to one system of time reckoning, but not so 
according to another. 

We shall now further generalize the coordinates allocated to an 
event. Let x! (i = 1,2, 3,4) be any functions of the €' such that, to each 
set of values of the £' there corresponds one set of values of the x , and 
conversely. We shall write 

x =e ee). (46.4) 
Then the x‘, also, will be accepted as coordinates, with respect to a 
new frame of reference, of the event whose coordinates were previ- 
ously taken to be the € . It should be noted that, in general, each of 
the new coordinates x! will depend upon both the time and the position 
of the event, i.e., it will not necessarily be the case that three of the 
coordinates x! are spatial in nature and one is temporal. All possible 
events will now be mapped upon a space 4, so that each event is 
represented by a point of the space and the x! will be the coordinates 
of this point with respect to a coordinate frame. /, will be referred to 
as the space-time continuum. 

It has been remarked that, in any gravitational field, it is always 
possible to define a frame relative to which the field vanishes over a 
restricted region and which behaves as an inertial frame for events 
occurring in this region and extending over a small interval of time. 
Suppose, then, that such an inertial frame S is found for two con- 
tiguous events. Any other frame in uniform motion relative to S will 
also be inertial for these events. Observers at rest in all such frames 
will be able to construct rectangular Cartesian axes and synchronize 
clocks in the manner described in Chapter 1 and hence measure the 
proper time interval dr between the events. If, for one such observer, 
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the events at the points having rectangular Cartesian coordinates 
(x,y,z), (x+dx,y+dy,z+dz) occur at the times t, t+ dt respectively, 
then 


dr? = di? (dx? + dy?+ dz?) (46.5) 


The interval between the events ds will be defined by 
ds? = —c*dr? = dx? +dy*+dz’—c*dt*. - (46.6) 


The coordinates (x, y, z, f) of an event in this quasi-inertial frame, will 
be related to the coordinates x! defined earlier, by equations 


x = x(x!, x, x3, x’), etc. (46.7) 
ox 1; 
and hence dx = Ai , etc. (46.8) 


Substituting for dx, dy, dz, dt in equation (46.6), we obtain the result 
ds* = g,dx' dx’, (46.9) 


determining the interval ds between two events contiguous in space- 
time, relative to a general coordinate frame valid for the whole of 
space-time. The space-time continuum can accordingly be treated asa 
Riemannian space with metric given by equation (46.9). 


47. Motion of a free particle in a gravitational field 

In a region of space which is at a great distance from material bodies, 
rectangular Cartesian axes Oxyz can be found constituting an inertial 
frame. If time is measured by clocks synchronized within this frame 
and moving with it, the motion of a freely moving test particle relative 
to the frame will be uniform. Thus, if (x,y,z) is the position of such a 
particle at time tf, its equations of motion can be written 


es ee ee eNO: (47.1) 


Let ds be the interval between the event of the particle arriving at the 
point (x, y, z) at time ¢ and the contiguous event of the particle arriving 
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at (x+dx,y+dy,z+dz) at t+dt. Then ds is given by equation (46.6) 
and, if v is the speed of the particle, it follows from this equation that 


ds = (v?—c?)'!? at, (47.2) 


Since v is constant, it now follows that equations (47.1) can be ex- 
pressed in the form 


—= = — = — = 0. (47.3) 


Also, from equation (47.2), it may be deduced that 


d*t 

qa > 0. (47.4) 
Equations (47.3) and (47.4) determine the family of world-lines of free 
particles in space-time relative to an inertial frame. 

Now suppose that any other reference frame and procedure for 
measuring time is adopted in this region of space, e.g. a frame which 
is in uniform rotation with respect to an inertial frame might be 
employed. Let (x!,x?,x3,x*) be the coordinates of an event in this 
frame. The interval between two contiguous events will then be given 
by equation (46.9). If an observer using this frame releases a test 
particle and observes its motion relative to the frame, he will note that 
it is not uniform or even rectilinear and will be able to account for this 
fact by assuming the presence of a gravitational field. He will find that 
the particle’s equations of motion are 

2 i jayk 
Ate, jae ll (47.5) 


This must be the case for, as shown in section 44, this is a tensor 
equation defining a geodesic and valid in every frame if it is valid in 
one. But, in the xyzt-frame, the g, are all constant and the three 
index symbols vanish. Hence, in this frame, the equations (47.5) 
reduce to the equations (47.3), (47.4) and these are known to be true 
for the particle’s motion. We have shown, therefore, that the effect 
of a gravitational field of the reducible variety upon the motion of a 
test particle can be allowed for when the form taken by the metric 
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tensor gj, of the space-time manifold is known relative to the frame 
being employed. This means that the g,; determine, and are determined 
by, the gravitational field. 

The ideas of the previous paragraph will now be extended to regions 
of space where irreducible gravitational fields are present. It has been 
pointed out that, for any sufficiently small region of such space and 
interval of time, an inertial frame can be found and consequently the 
paths of freely moving particles will be governed in such a small region 
by equations (47.5). It will now be assumed that these are the equations 
of motion of free particles without any restriction, i.e. that the world- 
line of a free particle is a geodesic for the space-time manifold or that 
the world-line of a free particle has constant direction. This appears 
to be the natural generalization of the Galilean law of inertia whereby, 
even in an irreducible gravitational field, a particle’s trajectory 
through space-time is the straightest possible after consideration has 
been given to the intrinsic curvature of the continuum. It will then 
follow that the motions of particles falling freely in any gravitational 
field can be determined relative to any frame when the components 
gy of the metric tensor for this frame are known. Thus the gy will 
always specify the gravitational field observed to be present in a frame 
and the only distinction between irreducible and reducible fields will 
be that, for the latter it will be possible to find a coordinate frame in 
space-time for which the metric tensor has all its components zero 
except 

f1=82=83=1 gay = —C’, (47.6) 
whereas for the former this will not be possible. 


48. Einstein’s law of gravitation 

According to Newtonian ideas, the gravitational field which exists in 
any region of space is determined by the distribution of matter. This 
suggests that the metric tensor of the space-time manifold, which has 
been shown to be closely related to the observed gravitational field, 
should be calculable when the matter distribution throughout space- 
time is known. We first look, therefore, for a tensor quantity describ- 
ing this matter distribution with respect to any frame in space-time 
and then attempt to relate this to the metric tensor. The energy— 
momentum tensor 7;;, defined in Chapter 4 with respect to an inertial 
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frame, immediately suggests itself. Both matter and electromagnetic 
energy contribute to the components of this tensor but since, accord- 
ing to the special theory, mass and energy are basically identical, it is 
to be expected that all forms of energy, including the electromagnetic 
variety, will contribute to the gravitational field. 

Since the energy-momentum tensor has been defined in inertial 
frames only, this definition must now be extended to apply to a gen- 
eral coordinate frame in space-time. This can be carried out thus: A 
rectangular Cartesian inertial frame can beestablished for the neigh- 
bourhood of any point of a gravitational field and valid for a short 
time duration. Corresponding to this frame and its associated clocks, 
rectangular Cartesian axes Oy! y*y*y* can be constructed in the 
region surrounding the corresponding point P of space-time. The 
transformation equations relating the coordinates y’ of an event to its 
coordinates x! with respect to any other coordinate frame can now be 
found. Then, if Tf are the components of the energy-momentum 
tensor in the y-frame at the point P, its components in the x-frame at 
this point can be determined from the appropriate tensor transforma- 
tion equations. Thus, the covariant energy-momentum tensor will 
have components 7;, in the x-frame given by 


r Ss 
dy” Oy" (0) 


i = ax! ax! rs ° (48.1) 


Since covariant and contravariant tensors are indistinguishable with 
respect to rectangular Cartesian axes, T© can also be taken to be the 
components of a contravariant tensor in the y-frame and the com- 
ponents of this tensor in the x-frame will then be given by the equation 


j 
T! = — — 7), (48.2) 


Similarly, the components of the mixed energy~momentum tensor are 
given by 


Ti = — —T® (48.3) 


These transformations can be carried out at every point of space-time, 
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thus generating for the x-frame an energy-momentum tensor field 
throughout the continuum. 
Consider the tensor equation 


TY, = 0. (48.4) 


Expressed in terms of the coordinates y! at any point of space-time, 
this simplifies to 

T{?) = 0, (48.5) 
which is equation (28.18). Being valid in one frame, therefore, equation 
(48.4) is true for all frames. Thus, the divergence of the energy- 
momentum tensor vanishes. If, therefore, this tensor is to be related to 
the metric tensor g;,, the relationship should be of such a form that it 
implies equation (48.4). Now 


g!., = 0, (48.6) 
by equation (41.12) and hence, a fortiori, 

g! = 0. (48.7) 
The law TY = dg’, (48.8) 


where A is a universal constant, would accordingly be satisfactory in 
this respect. However, over a region in which matter and energy were 
absent so that T4 = 0, this would imply that 


g/=0, (48.9) 


which is clearly incorrect. Further, according to Newtonian theory, 
if w is the density of matter, the gravitational field can be derived from 
a potential function V satisfying the equation 


V7 V = 4rryp, (48.10) 


where y is the gravitational constant. The new law of gravitation 
which is being sought must include equation (48.10) as an approxi- 
mation. But, as appears from equation (28.12), 744 involves pu and it 
seems reasonable, therefore, to expect that the other member of the 
equation expressing the new law of gravitation will provide terms 
which can receive an approximate interpretation as V* V. This implies 
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that second order derivatives of the metric tensor components will 
probably be present. We therefore have a requirement for a second 
rank contravariant symmetric tensor involving second order deriva- 
tives of the g,, and of vanishing divergence to which T can be assumed 
proportional. Einstein’s tensor (43.18) possesses these characteristics 
and consequently we shall put 


RY—4e4R = —xT", (48.11) 


where « is a constant of proportionality which must be related to y 
and which we shall later prove to be positive. Equation (48.11) 
expresses Ejinstein’s Law of Gravitation; by lowering the indices suc- 
cessively, it may be expressed in the two alternative forms 


Rj—48}R = —xTi, (48.12) 
Ry—teyR = —KTy. (48.13) 

If equation (48.12) is contracted, it is found that 
R=kxT, (48.14) 


where T = T!. It now follows that Einstein’s law of gravitation can 
also be expressed in the form 


Ry = «GgyT— Ty), (48.15) 


with two other forms obtained by raising subscripts. 
Since the divergence of g” vanishes, a possible alternative to the 
law (48.11) is 
RY—he4 R+Ag! = —KT, (48.16) 


where A is a constant. The law (48.11) gives results which agree with 
observation over regions of space of galactic dimensions, so that it is 
certain that, even if A is not zero, it is exceedingly small. However, the 
extra term has entered into some cosmological investigations. It will 
be disregarded in later sections of this book. 


49. Acceleration of a particle in a weak gravitational field 

In a gravitational field, such as the one due to the Earth, the geometry 
of space is not Euclidean and no truly inertial frame exists. In spite of 
this, we experience no practical difficulty in establishing rectangular 
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Cartesian axes Oxyz at the Earth’s surface relative to which for all 
practical purposes the geometry is Euclidean and the behaviour of 
electromagnetic systems is indistinguishable from their behaviour in 
an inertial frame. It must be concluded, therefore, that such a gravi- 
tational field is comparatively weak and hence that, with respect to 
such axes and their associated clocks, the space-time metric will not 
differ greatly from that given by equation (46.6). Putting 


xi=x, P=y, 2=2, x‘ = ict, (49.1) 
in terms of the x' the metric will be given by 
ds? = dx! dx! (49.2) 


approximately. With respect to the x'-frame, it will accordingly be 
assumed that 
gy = Sythy, (49.3) 
where the 5,, are Kronecker deltas and the hy are small by comparison. 
Consider a particle moving in a weak gravitational field whose 
metric tensor is given by equation (49.3). The contravariant metric 
tensor will be given by an equation of the form 


gl = s/+k, (49.4) 
where the k! are of the same order of smallness as the hy. Then, since 


[i,k] = 3( Se +— ax 7 Bk (49.5) 
it follows that, to a first approximation, 
oh oh oh 
kr jk ik Oy 
{*} = 8", = 5(F txi ~ Oxk mt): (49.6) 


The equations of motion of the particle can now be written down as 


at (47.5). 
By equation (47.2) 
dx! = dx! dt 


where v is the particle’s velocity in the quasi-inertial frame. Hence, 
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if the particle is ee in the frame at the instant under con- 
sideration, 


dx! 
* = (0,1), (49.8) 
and the equations of motion (47.5) reduce to the form 
d*x i 
“ds? + {4 ‘a a (49.9) 
correct to the first order in the hy. Substituting from equation (49.6), 
this is seen to be equivalent to 
d*x' 1 ahig aha 
dst 2 dx! ax a 


Differentiating equation (49.7) with respect to s and making use of 
equation (47.2), we obtain 


d? x! du 
<r = (v?—¢*)=! (3. 0) - v7 wo) 2@, ic) (49.11) 
-and, when v = 0, this reduces to 
d? x! 1 /dv 
7 i -3(4.0). (49.12) 


From equations (49.10), (49.12), we deduce that the components of 
the acceleration of the stationary particle in the directions of the 


rectangular axes are 
1 Ohas4 Ohy; 
me {Gere eat) 


for i= 1, 2, 3. Reverting to the original coordinates (x, y, z, t), these 
components are written 


12hg4 i dh 
—<¢? (544+ ; Fa) , etc. (49.14) 


Hence, if the field does not vary with the time, the acceleration vector 
is 
— grad (4c h4,). (49.15) 
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But, if Vis the Newtonian potential function for the field, this accelera- 
tion will be — grad V. It follows that, for a weak field, a Newtonian 
scalar potential V exists and is related to the space-time metric by the 
equation 

V = 4ce7hgy. (49.16) 
Alternatively, we can write 


2V 
844 = 1+ aE (49.17) 


50. Newton’s law of gravitation 
In this section it will be shown that Newton’s Law of Gravitation may 
be deduced from Einstein’s Law in the normal case when the gravi- 
tational field’s intensity is weak and the matter distribution is static. 
First consider the form taken by the Riemann-Christoffel tensor in 
the space-time of a weak field. In the x/-frame, the metric tensor is 
given by equation (49.3) and the Christoffel three-index symbols by 
equation (49.6). If products of the A, are to be neglected, equation 
(37.21) shows that 


tea ed af 
Bia = salib-satia (50.1) 
approximately. Hence the Ricci tensor is given by 
7) 7) 
= i i 
Rn = ralid- patra 


_1a@ pit hy _ oh 


~ 2 ax* ax/ ax! 
~ 92 ax! “axd axe 


2 5; a hu “thy * hy “tay (50.2) 


axl ax” axt ax! axtax* ax! ax] 
In particular, putting j = k = 4, we find that 
1( ah ah ah 
Ru = | seagate = ree (50.3) 


2) ax4 ax4 © axtax! ~ ax! ax4 
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If the matter distribution is static in the quasi-inertial frame being 
employed, the A,; will be independent of t and equation (50.3) reduces 
to 

Ryg = $V7 haa, (50.4) 


where V? = @?/ax*+ 0?/ay*+ 87/02”. If V is the Newtonian potential 
for the field, equation (49.16) now shows that 


Ry = 5V7 V. (50.5) 


Assuming that no electromagnetic field is present, the overall energy— 
momentum tensor for the mass distribution will be given in the 
quasi-inertial frame by the equation 


where 9,; is determined approximately by equation (28.10). Since the 
distribution is static, its 4-velocity at every point is (0, ic) and hence 
all components of Tj, with the exception of 744, are zero. In this case, 


144 ea ¢ 00 (50.7) 


where joo, for zero velocity of matter, is the ordinary mass density. 
Also 
T = Ti = Ty = Tyg = —C7 U0. (50.8) 


The 44-component of Einstein’s gravitation law in the form of 
equation (48.15) can now be expressed approximately 


1 
av V= }uc? L090, 


or V2V = 3xc* p99. (50.9) 


This is the Poisson equation (48.10) of classical Newtonian theory, 
provided we accept 


c= “. (50.10) 


This specifies x in terms of the gravitational constant. 
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51. Metrics with spherical symmetry 
When a change is made in the space-time coordinate frame from 
coordinates x! to coordinates x , the metric tensor gj will change to 
8, by the law of transformation of a covariant tensor. In general, the 
gy Will be functions of the x! and the 2, will be functions of the X', but 
it will not usually be the case that the 7, are the same functions of the 
-*barred’ coordinates that the g;; are of the ‘unbarred’ coordinates. 
Le., the functions g,(x") are not form invariant under general co- 
ordinate transformations. However, insome special cases, it is possible 
for these functions to be form invariant under a whole group of 
transformations and we shall study such a case in this section. 

In a gravitational field, the geometry can only be quasi-Euclidean 
and consequently rectangular Cartesian axes do not exist. Neverthe- 
less, no difficulty is experienced in practice in defining such axes 
approximately and we shall suppose, therefore, that the coordinates 
x, y, Z, t of an event in the gravitational field about to be considered 
are interpreted physically as rectangular Cartesian coordinates and 
time. We shall now search for a metric which, when expressed in those 
coordinates, is form invariant with respect to the group of coordinate 
transformations which will be interpreted physically as rotations of 
the rectangular axes Oxyz (t is to remain unaltered). Such a metric 
will be said to be spherically symmetric about O. 

Invariants for this group of coordinate transformations, which are 
of degree no higher than the second in the coordinate differentials dx, 
dy, dz, are 

xe+y*+z2, xdxt+ydy+zdz, dx*+dy*+dz*. (51.1) 


Introducing spherical polar coordinates (r, 8,4), which will be defined 
by the equations (29.3), these invariants may be written 


r, rdr, dr’+rd0*+r* sin? 6d¢’. (51.2) 
It follows that r, dr, d0?+sin?0dd? (51.3) 


are invariants. The most general metric with spherical symmetry can 
now be built up in the form 


ds* = A(r,t) dr? + B(r, t) (d6? + sin? 6 dd) + 
+ C(r, t)dr dt + D(r, t) dt. | (51.4) 
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We now replace r by a new coordinate 7’ according to the transfor- 
mation equation 


r’? = B(r,t). (51.5) 
Then ds? = E(r’,t) dr’? + r'?(d0? + sin? 0d?) + 
+ F(r’, t)dr’ dt+ G(r’, 0) dt? (51.6) 


In a truly inertial frame, spherical polar coordinates can be defined 
exactly and the metric will, by equation (46.6), be expressed in the 
form 
ds* = dr* +-r°(d0* + sin? 0 dd”) — c? dt. (51.7) 
Comparing equations (51.6), (51.7), it is clear that, in a region for 
which (51.6) is the metric, 7’ will behave physically like a true spherical 
polar coordinate r. We shall accordingly drop the primes and write 
ds? = E(r,t) dr* +r°(d0* + sin? 6 dd”) + 
+ F(r,t)drdt+G(r, t) dt. (51.8) 
If our frame is quasi-inertial, equation (51.7) must be an approxi- 
mation for equation (51.8) and the following equations must therefore 
be true approximately: 
E(,t)=1, F(r,t) =0, G(r,t) = —c*. (51.9) 
Consider now the special case when the gravitational field is static 
in the quasi-inertial frame for which (r, 9, 6) are approximate spherical 
polar coordinates and ¢ is the time. The functions E, F, G will then be 
independent of t. Also, space-time will be symmetric as regards past 
and future senses of the time variable and this implies that ds* is 
unaltered when dt is replaced by — dt. Thus F = 0 and we have 


ds? = adr* +r7(d0* + sin? 0d”) — bc* dt”, (51.10) 
where a, b are functions of r both approximating unity. 
For this metric, taking 
xl=r, x*7=060, x =¢4, xt=1, (51.11) 
we have 
Su= 4, 2=7', 833 =Pr'sin*6, g4q = —be*, (51.12) 
all other g;, being zero. Thus 
g = —abc*r'sin? 6, (51.13) 
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and hence 
1 1 1 
1 7 33) 5. 
a’ © pe FP sin? 6” 
all other g being zero. The three-index symbols can now be calculated 
from equation (41.9). Those which do not vanish are listed below: 


1 
i a 1: 
g jou? (51-14) 


(iiy=—> 
{7} = O71} = a 


1 
{33} = G7} = -> 


r 
b’ 
es ed 
{1°43 = {471} ah 
P (51.15) 
{2'2} = sae 
{073} = {372} = cotd, 
{15} = —<sin?6, 
{,73} = —sin @cos8, 
cbt 
{aa = aa’ 


primes denoting differentiations with respect to r. 
Contracting Bi xt AS given by equation (37.21) with respect to the 
indices i, /, it follows that 


‘ Oo. o., 
R= I S-OU + axk {ji axl {kh 
= {,' Pits pa {j zzlos V(-g)+ 


eee fag slog y(- eee. es (51.16) 
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by equation (43.4) (g can clearly be replaced by —g in this equation 
without affecting its validity). The non-zero components of the Ricci 
tensor are now calculated to be as follows: 


Au = 35" 46? dab ar’ 
rb’ ra 1 
Ry» = 5-335 t-— lI, 
22 ab 2a" a (51.17) 


R33 = R22 sin” 0, 


Ry = (FF eS -2), 
2a 4ab 4a” ar 
Since R! = g!! Ri, RZ = 27? Ro», etc. (51.18) 
it will be found that 


br bY ab 2b da’ 2 2. 
ab 2ab* 2a*b abr a*r ar* fr’ 


R=R= (51.19) 


52. Schwarzschild’s solution 

The static, spherically-symmetrical, metric (51.10) will determine the 
gravitational field of a static distribution of matter also having 
spherical symmetry, provided it satisfies Einstein’s equations (48.15). 
We shall consider the special case when the whole of space is devoid 
of matter, apart from a spherical body with its centre at the centre of 
symmetry O. Then 7; = 0, T= 0 at all points outside the body and 
Einstein’s equations reduce in this region to 


4 = 0. (52.1) 


By equations (51.17), these are satisfied by the metric (51.10), provided 
a, b are such that 


ob 4 dah a ree! 
rb’ ra’ 1 
tb tee p22) 
” 72 oe , 
b” b ab’ b a (52.4) 


2a 4ab 4a* ar 
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From (52.2), (52.4), it follows that 
ab’+a’b=0 (52.5) 
and hence that ab = constant. (52.6) 


But, as 7 >, we shall assume that our metric approaches that given 
by equation (51.7) and valid in the absence of a gravitational field. 
Thus, as r->0o, then a—>-1, b->1 and hence 


ab=1, © (52.7) 
Eliminating b from equation (52.3), it will be found that 
ra’ = a(1—a). (52.8) 
This equation is easily integrated to yield 
a= sass . (52.9) 
1~(2m/r) 
where mm is a constant of integration. Then 
b= 1-—, (52.10) 


and it may be verified that each of the equations (52.2)-(52.4) is 
satisfied by these expressions for a and b. 
We have accordingly arrived at a metric 


ds* = +r°(d0* + sin? ag) — (1 -*") dt? (52.11) 


1—(2m/r) r 

which is spherically symmetrical and can represent the gravitational 
field outside a spherical body with its centre at the pole of spherical 
polar coordinates (7, 0,4). This was first obtained by Schwarzschild. 
It will be proved in the next section that the constant mis proportional 
to the mass of the body. This may also be deduced from equation 
(49.17), for the potential V at a distance r from a spherical body of 
mass M is given by 

yM 


= —— (52.12) 
r 
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and hence 844 = 1- ae (52.13) 
Now 24,4 is the coefficient of (dx*)? = —c*dt? in the metric and hence 
2yM 
b=1-2—. (52.14) 
cr 


Comparing equations (52.10), (52.14), it will be seen that 


M 
="). (52.15) 
Cc 


It is clear from equation (52.11) that the metric is singular at a 
distance 


r=2m= ar (52.16) 


from O. We conclude that the radius of the body must certainly be 
greater than this minimum value. Since, in c.g.s. units, c = 3 x 10!° 
and for the Earth yM = 3-991 x 107°, the minimum radius for this 
body is about 9 mm. 


53. Planetary orbits 


The attractions of the planets upon the Sun cause this body to have 
a small acceleration relative to an inertial frame. If, therefore, a co- 
ordinate frame moving with the Sun is constructed, relative' to this 
frame there will be a gravitational field corresponding to this accelera- 
tion in addition to that of the Sun and planets. However, for the pur- 
pose of the following analysis, this field and the fields of the planets 
will be neglected. Thus, relative to spherical polar coordinates having 
their pole at the centre of the Sun, the gravitational field will be 
assumed determined by the Schwarzschild metric (52.11). The planets 
will be treated as particles possessing negligible gravitational fields, 
whose world-lines are geodesics in space-time (section 47). We pro- 
ceed to calculate these geodesics. 
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Substituting for a, b from equations (52.9), (52.10) in equations 
(51.15), it will be found that 


(12) =, 
{1°} =, 
(223 (53.1) 


{2'2} = —(r—2m), 
{.°3} = cot}, 

{,'5} = —(r—2m)sin20 
{,73} = —sin@cos9, 


2 
{ala} = Sy (72m), 


Equations (44.5) for the geodesics accordingly take the form: 


d*r m dé 2,(4¢ 
ds* r(r— Grim) a 2m)(5) i (Ft ‘) - 


me2 (dt\2) 


d*@ 2drdé dp\? 
— x +-——-sin@cos6{[—} = ; 
gots Te de tn Pecos ( 4 0, (53.3) 
d*$ 2drdd d6 dé 
ads ea Fade Pe = 0, (53.4) 


d*t 2m drdt _ 


ge as Oe eae 


(53.5) 
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There is also available the first integral (44.6) of these equations 
which, by equation (52.11), is 


r_ [ar\? do dd c? dt\? 
—() +A((2) +sin?o(% }-<¢-2m(Z) = |, (53.6) 


Equation (53.2) will be discarded in favour of this first integral. 

We now choose the spherical polar coordinates so that the planet 
is moving initially in the plane 6 = 47. Then d0/ds = 0 initially and 
hence, by equation (53.3), d?6/ds* = 0 at this instant and the particle 
continues to move in this plane indefinitely. Thus, putting 0 = 47, 
d0/ds = 0 in the remaining geodesic equations, we find these reduce to 

d*¢ 2drdd 


deep ae de = 0, (53.7) 


d*t 2m drdt 
Se te ee ee ; 
ds” ss r(r—2m)dsds’ eo) 


cal) AR) Seale) 1 os 


Putting w= d¢/ds, v= dt/ds, equations (53.7), (53.8) may be 
written 


dw 2 
Yt w= 0, (53.10) 
dr r 
dv 2m 
=o = 0, 3.11 
Fa a Pe C20) 


respectively. These equations can now be integrated to yield 


a 
= 7.5, (53.12) 

s or 

dt Br 
pata, (53.13) 


where «, 8 are constants of integration. 
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Substituting for dd/ds, dt/ds from the last two equations into 
equation (53.9), it follows that 


dr\? 2m 
si —-(r—2m) = 2g2_ oT , : 
( 4 +3 (r—2m) = 1+c*B - (53.14) 


Then, eliminating ds between this equation and eqpation (53.12), we 
obtain the equation for the orbit, viz. 


adr\? a 2 2 

baer aust 57 = oo = | 
(55) +5 1+c*B - a (53.15) 
With u = 1/r, this reduces to the form 


du\? _ _1+0?f? 2m 3 
(5) tab = SE Bt aml, (53.16) 


Differentiating through with respect to ¢, this equation takes a form 
which is familiar in the theory of orbits, viz. 
du 
dd? 
The corresponding equation governing the orbit according to 
classical mechanics is 


+u = —5 43m. (53.17) 
a 


d7u M 
ptt roe (53.18) 


where M is the mass of the attracting body and h is the constant 
velocity moment of the planet about the centre of attraction, i.e., 


dd 
4 A: 19 
r , h (53.19) 


The general relativity counterpart of this last equation is clearly 
equation (53.12). If 7 is the proper time for the orbiting body, by 
equation (46.6) icdr = ds and equation (53.12) is equivalent to 

2a¢ 


r dr = ica. (53.20) 
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We shall identify the time variable t of classical mechanics with the 
proper time 7 for the body whose motion is being studied and then, 
by comparison of equations (53.19), (53.20), it is seen that 


h = ice. (53.21) 
Equation (53.17) then becomes 
du me? 2 
dg? +u= Re +3mu (53.22) 


which, apart from a term 3mu?, is identical with the classical equation 
(53.18) provided 


yM 
m=-7° 


c 


(53.23) 


This confirms equation (52.15). 
The ratio of the additional term 37mu’ to the ‘inverse square law’ 
term mc?/h? is 


s-=5r¢ (53.24) 


by equation (53.19). rp is the transverse component of the planet’s 
velocity and, for the planets of the solar system, takes its largest value 
in the case of Mercury, viz. 48 km/sec. Since c = 3 x 10° km/sec, the 
ratio of the terms is in this case 7-7 x 10~8, which is very small. 
However, the effect of the additional term proves to be cumulative, 
as will now be proved, and for this reason an observational check can 
be made. 
The solution of the classical equation (53.18), viz. 


pee tl +ecos(¢—&)}, (53.25) 


where . = yM = mc’, e is the eccentricity of the orbit and & is the 
longitude of perihelion, will be an approximate, though highly accur- 
ate, solution of equation (53.22). Hence the error involved in taking 


> 3m 


2 
3m = — 1 4ecos(¢—&) (53.26) 


h* 
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will be absolutely inappreciable, since this term is very small in any 
case. Equation (53.22) can peas be replaced by 

a 


d¢ 


This equation will possess a solution of the form (53.25) with ad- 
ditional ‘particular integral’ terms corresponding to the new term 
(53.26). These prove to be as follows: 


atu oe PackeeuG= &)}. (53.27) 


- 2 


te” cos 2(P—&) + edsin(A—@)}. (53.28) 


The constant term cannot be observationally separated from that 
already occurring in equation (53.25). The term in cos2(¢—) has 
amplitude too small for detection. However, the remaining term has 
an amplitude which increases with ¢ and its effect is accordingly 
cumulative. Adding this to the solution (53.25), we obtain 


Ss 


u= f{1-+ecos(— @)+ ¢sin (¢— ay} 


h2 
= pul +ecos(¢—&—85)}, (53.29) 


where 8 = 3mpd¢/h* and we have neglected terms O(8d7). 
Equation (53.29) indicates that the longitude of perihelion should 

steadily increase according to the ee 

o aie 


> = 3 $ 4 +4 (53.30) 


where / = h? |p is the semi-latus rectum of the orbit. Taking 
p = 1:33 x 107° 


c.g.s. units for the Sun, c = 3 x 10!° cm/sec and / = 5:79 x 10!? em for 
Mercury, it will be found that the predicted angular advance of 
perihelion per century for this planet’s orbit is 43”. This is in agree- 
ment with the observed value. The advances predicted for the other 
planets are too small to be observable at the present time. 
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54. Gravitational deflection of a light ray 

In section 7 it was shown that the proper time interval between the 
transmission of a light signal and its reception at a distant point is 
zero. It was there assumed that the signal was being propagated in an 
inertial frame and hence that no gravitational field was present. This 
result can be expressed by saying that 


ds = 0 (54.1) 


for any two neighbouring points on the world-line of a light signal. 
Now, null-geodesics in the space-time having metric (46.6) are defined 
by equation (54.1) and the equations 


ae gts. Hy 2; 
GX dy Oe et (54.2) 


for the three index symbols are all zero. Equations (54.2) imply that 
along a null-geodesic x, y, z are linearly dependent upon ¢. But this 
is certainly true for the coordinates of a light signal being propagated 
in an inertial frame. We conclude that the world-lines of light signals 
are null-geodesics in space-time. 

Since an inertial frame can always be found for a sufficiently small 
space-time region even in the presence of a gravitational field, it fol- 
lows that the world-line of a light signal in any such region is a null- 
geodesic. We shall accept the obvious generalization of this result, 
viz. that the world-lines of light sigaals over an unlimited region of 
space-time are null-geodesics. 

We shall now employ this principle to calculate the path of a light 
ray in the gravitational field of a spherical body. Taking the space- 
time metric in the Schwarzschild form (equation (52.11)), the three- 
index symbols are given at equation (53.1) and the equations govern- 
ing a null-geodesic (equations (44.14)) are identical with the equations 
(53.2)-(53.5) after s has been replaced by A. The first integral (44.13) 
takes the form 


r (dr\> ,((d6\? . ,,(dd\*\_ ¢* dt\? _ 
(9) +r (5) +sin 0(%) |< (-—2m)(5) = 0. 
( 


54.3) 
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Without loss of generality, we shall again put 9 = $7, so that a ray 
in the equatorial plane is being considered and then proceed exactly 
as in the last section to derive the equation 


d2 
pe u = 3mu’, (54.4) 


where u = 1/r. This equation determines the family of light rays in the 
equatorial plane. 

As a first approximation to the solution of equation (54.4), we shall 
neglect the right-hand member. Then 


u= 008 (p+), (54.5) 
where R, « are constants of integration. This is the polar equation of 
a straight line whose perpendicular distance from the centre of 
attraction is R. As might have been expected, therefore, provided the 
gravitational field is not too intense, the light rays will be straight lines. 
This deduction is, of course, confirmed by observation. Thus, as the 
Moon’s motion causes its disc to approach the position of a star on the 
celestial sphere and ultimately to occult this body, no appreciable 
deflection of the position of the star on the celestial sphere can be 
detected. 

Again, without loss of generality, we shall put « = 0 so that the 
light ray, as given by equation (54.5), is parallel to the y-axis 
(¢ = +47). Then, putting u = cos¢/R in the right-hand member of 
equation (54.4), this becomes 

d*u _ 3m 
dg? +u= R 


The additional ‘ particular integral’ term is now found to be 


cos? d. (54.6) 


aa (2— cos $) (54.7) 


and hence the second approximation to the polar equation of the 
light ray is 


1 m 
u = =cos p+ re (2—cos*¢). (54.8) 
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At each end of the ray u = 0 and hence 
2 
7008" d—cos¢— > = 0. (54.9) 


Assuming m/R to be small, this quadratic equation has a small root 
and a large root. The small root is approximately 


2m 
Siete 54.10 
cos d a ( ) 
wv 2m 
and hence @= + (+ =) (54.11) 


at the two ends of the ray. The angular deflection in the ray caused. 
by its passage through the gravitational field is accordingly 
4m 


= (54.12) 


approximately. 
For a light ray grazing the Sun’s surface, 


R = Sun’s radius = 6:95x10!°cm and m= 1-5x10°cm. 


Thus the predicted deflection is 8-62 x 10~° radians, or about 1-77’. 
This prediction has been checked by observing a star close to the 
Sun’s disc during a total eclipse. The experimental findings are in 
accord with the theoretical result. 


55. Gravitational displacement of spectral lines 

A standard clock will be taken to be any device which experiences a 
periodic motion, each cycle of which is indistinguishable from every 
other cycle. The passage of time between two events which occur in 
the neighbourhood of the clock is then measured by the number of 
cycles and fraction of a cycle which the device completes between 
these two instants. The clocks employed to determine the time co- 
ordinate ¢* of an event in section 46 were not, necessarily, standard 
clocks. Such coordinate clocks can have arbitrary variable rates, the 
only requirement being that, if A, B are two events in the vicinity of 
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a coordinate clock and B occurs after A, then the coordinate-time for 
B must be greater than the coordinate-time for A. 

Consider, then, a standard clock which is moving in any manner 
within some reference frame. Let A, B be the two events representing 
the commencement and termination of one clock cycle and let C, D 
be two events separated by another clock cycle. Since we are assuming 
that the clock cycles are indistinguishable, the geometrical relation- 
ship between A and Bin space-time must be identical with the relation- 
ship between C and D. It follows that the space-time intervals between 
A and B and between C and D are equal. Thus, every cycle of a stan- 
dard clock registers its advance along its world-line through a con- 
stant interval and, if ds is the interval between neighbouring points on 
this world-line, the quantity whose passage is registered by the clock 
will be 

( ds, (55.1) 


integrated along the world-line. I.e. a standard clock registers the 
passage of interval along its trajectory. 

Let x! (i = 1,2,3,4) be the coordinates of an event with respect to 
some space-time reference frame, x, x”, x? being interpreted physic- 
ally as spatial coordinates relative to a static frame and x*/ic as time. 
If a standard clock is at rest relative to this frame, for adjacent points 
on its world-line dx! = dx? = dx? = 0 and hence 


ds? = gy(dx*)? = —c? gygdt’, (55.2) 


where we have put x* = ict. The interval s measured by the clock is 
therefore related to t, the coordinate-time at the point (x!, x2, x), by 
the equation 


s=ic | V(e44) dt. (55.3) 


s is imaginary only because we have chosen to define the interval ds 
between two events in such a way that time-like intervals are imagin- 
ary. The standard clock can be graduated to register T = s/ic and then 
standard clock time T will be related to coordinate-time by the 
equation 


T= | V(gsa)at. (55.4) 
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In the special case of the coordinate frame employed in section 49 
which was stationary in a relatively weak static gravitational field, it 
was proved that g4, is given in terms of the Newtonian scalar potential 
V for the field by the approximate equation (49.17). Thus 


WV 1/2 
aT = (1424) dt (55.5) 


relates time intervals measured by a stationary standard clock and a 
coordinate-clock at a point in a gravitational field where the potential 
is V. Now, when it is emitting its characteristic spectrum, an atom is 
operating as a standard clock. If, therefore, two similar atoms are 
stationary at different positions in a static gravitational field and they 
emit their characteristic radiation, the two intervals corresponding to. 
the emission of one complete cycle of a certain spectral line by each 
atom will be identical. If dT is this standard-time interval, V;, V2 are 
the gravitational potentials at the atoms, and df), dt, are the periods 
of the complete cycles as measured in coordinate-time, then by 
equation (55.5) 


Vv. 1/2 UV. 1/2 
ar = (147%!) dt = (147%) dt, (55.6) 
2V\"/ 2V,\' 
and hence diy: deg = (14222) : (14224) . (55.7) 


Suppose that the radiation from the atoms is observed at some point 
P in the field. Let t, be the coordinate-time at one atom when a radi- 
ation wave crest is emitted and ¢, the corresponding time for the 
next crest. Let t7, t, be the respective coordinate-times of arrival of 
these crests at P. Since the field and coordinate frame are both static, 
the time delay between a crest leaving the atom and its arrival at P 
will be constant. Thus 

ti—ta = thts (55.8) 


or ti—th = tat. (55.9) 


This last equation shows that the period of vibration of the atom as 
measured by the coordinate-clock at P is independent of this point’s 
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position and is equal to the period as measured by the coordinate- 
clock at the atom itself. The frequencies v;, v. of corresponding spec- 
tral lines for the two atoms as measured at P will therefore, by 
equation (55.7), be in the ratio 


Vy 14+2V;/c? Vi—V2 
vy Fi (re Pioaee (32-10) 


approximately. 
In the case of an atom at the surface of the Sun and a similar atom 
at the Earth’s surface, it will be found that, in c.g.s. units, 


V, = —9-512x10!? (Earth), 
V>, = —1-914x10'* (Sun), 


H 


“ = 1-00000212. (55.11) 
2 


and thus 


This effect is so small, that it is very difficult to measure. However, in 
the case of the companion of Sirius, the predicted effect is 30 times 
larger and has been confirmed by observation. 


56. Maxwell’s equations in a gravitational field 

In this final section, the equations (23.11) determining the electro- 
magnetic field due to the motion in vacuo ofa distribution of electric 
charge, will be generalized to take account of any gravitational field 
which may be present, but we shall not elaborate upon the implications 
of the modified equations. 

Over any sufficiently small region of space and restricted interval 
of time it is possible to define a rectangular Cartesian inertial frame, 
i.e. the frame in ‘free fall’ in the gravitational field. If the electric and 
magnetic components of th> electromagnetic field are measured in 
this frame, the field tensor Fj; defined by equation (23.5) can be found. 
Employing the appropriate transformation equations, the com- 
ponents of this tensor relative to general coordinates x! in the gravi- 
tational field can be computed. No distinction is made between 
covariant and contravariant properties relative to the original in- 
ertial frame so that, when transforming, F;; may be treated as a 
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covariant, contravariant or mixed tensor. If it is treated as a covariant 
tensor, the covariant components F;; in the general x'-frame will be 
generated. If it is treated as a contravariant or as a mixed tensor, the 
contravariant or mixed components FY, F 4 respectively will be gener- 
ated. In this way, the field tensor is defined at every point of space- 
time. Similarly, a current density vector with covariant components 
J, and contravariant components J’ is defined relative to the x'-frame. 
Consider the equations 


FY = wal (56.1) 


Fij.4t Feit Fasj _ 0. (56.2) 


These are tensor equations and hence are valid in every space-time 
frame if they are valid in any one. But, relative to the inertial coordin- 
ate frame (x,y,z, ict) which can be found for any sufficiently small 
space-time region, these equations reduce to equations (23.11) and 
hence are valid over such a region. Regarding the whole of space-time 
as an aggregate of such small elements, it follows that equations (56.1), 


(56.2) are universally true. 
Since F” is skew-symmetric, 
: OF eg, 
FY s — gl TORE, 
aF! 1 
arta Sistem es Fr. 
x "We a “(W(-9)} 
1 
iG BY, 56.3 
= gan g) (56.3) 


by equation (43.4) (g has been replaced by —g, since g is always 
negative for a real gravitational field). Equation (56.1) is accordingly 
equivalent to 


1 
7 a(R —g)F}= ay (56.4) 


Further, since g is a relative invariant of weight 2 (cf. equation (32.7 )) 
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and hence +/(—g) is an invariant density, it follows that §”, 9 defined 
by the equations 


BY = V(-a) FY, 3 = V(-8)J'; (56.5) 
are densities and then equation (56.4) takes the simpler form 
oy ar, 
7 toe als as (56.6) 
x c 


Also, in view of the skew-symmetry of the field tensor, it follows 
that equation (56.2) is equivalent to 
OF; OF, OF xi 
axk © ax? © axi — i Ce) 
Exercises 6 
1. Prove that when an index of T” as defined by equation (48.2) is 
lowered, T; as defined by equation (48.3) is obtained. 
2. In the space-time whose metric is given by 


ds? = &4(dx*? + (dx!) + de? + ay, 

where ¢, @ are functions of x' only, prove that the Riemann- 
Christoffel tensor vanishes if and only if 

$’-0'd' +4” = 0 
where the dashes denote differentiations with respect to x!. If 6 = — 6, 
prove that the space is flat provided that 

¢ = $log(a+ bx!), 
where a, b are constants. 

(L.U.) 
3. If the metric of space-time is 
ds* = —eA(dx')* + (dx)?}— (xe Pdx?)? + eP(dx*, 

where A and p are functions of x! and x* only, show, by calculating 


Ry, that the field equations R; = 0 (for a region devoid of matter) 


require that p shall satisfy 
ep Pp + ep P 1 op =O 
(ax!)?  (ax2)? x7 ax?” 


(Li. U.) 
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4. Find the differential equations of the paths of test particles in 
the space-time of which the metric is 


ds? = & | — (dx? + dy’ + dz”) + dt’, 


where k is a constant. If 


v= ay’ (2) .(@) 
~ lat dt dt 


and if v = V when x = 0, show that 


1—v* = (1- VV) e7**, 
(Li. U.) 


5. Use the equations 
Ri-48!R = —xT! 


to find the energy-momentum tensor for the distribution of matter 
corresponding to the space-time 


ds* = —e*(dx* + dy* +dz*)+ dt’, 


where g is a function of t only. 


(Li. U.) 
6. If . 
gh = Nh ty? + ca? 
~ J—-kx c2 (1—kx)* 
where k is a constant, and if 
dx\? (dy\? [dz\? 
ov =(F) 4(7) +(F), 
at dt dt 
prove that, along a geodesic, 
V2-—v2 = kc’ x, 
where V is a constant. 
(L.U.) 


7. Show that the four differential equations (53.2)-(53.5) for the 
geodesics in the Schwarzschild space-time have a solution for which 
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6 = 40, r = a, where ais a constant greater than 3m, and that the total 
interval along this geodesic from ¢ = 0 to ¢ = 27 is 


1/2 
mia (< = ’ 
m 


Also show that there is a geodesic along which 6 = const.,¢ = const. 
and which satisfies an equation of the form 


dr\? 2 2m\—! 2m\? {1 1 
(7) =2"@(1-2) (1-7) (;-3)- 
where R is fixed. State briefly the physical interpretation of these 


results. 
(L.U.) 
8. A space-time has metric 


ds? = e?°{(dx!)? + (dx’)? + dx3)* + (dx*} 
where o is a function of (x!,x?,x°,x*). If ¢’ is the unit tangent to a 
geodesic, prove that 


dt* 
os +2(0;t') t* = o,e~”°, 


where Oo, = 


For slow motions in slowly changing fields, prove that the geodesics 
are paths of particles in a gravitational field of potential — ac’, 
(L.U). 
9. Find the Riemann-Christoffel tensor of the space-time of the 
last exercise, and prove that the scalar curvature R vanishes if, and 
only if, 
Cppt p90, = O 


ao 


where > 
PQ Ax? ax4 
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If o is a function of r = [(x')* + (x?)*+ (x3)?]!/? only, prove that this 
condition is 


” 2, 2 
sea es = 0, 


where dashes denote differentiations with respect to r. 


(L.U.) 


10. In the space-time of metric 
ds* = e?°{dr? +r? dé? + r* sin? 6. dd? — dt”} 


where o = log(i+m/r), and m is constant, prove that the scalar 
curvature is zero (see Exercise 9 for this condition). Prove that the 
geodesics satisfy the equations 


sin? 9% = k,e~7° 
ds 


where k,, k2 are constants. If we choose ¢ = 0, dd/ds = 0 initially, 
prove that ¢ is always zero, and that 
2 dé —_ —20e 
r i he’ ~°, 
where A is constant. 
(L. UV.) 
11. Show that 


42 = mola Leateatsa) 
where g is an arbitrary function and 
1 
o = P(x? +y? +27) 
c 


is invariant under a Lorentz transformation. 


If 


j' = px, P =py, P= pz, *= ot 
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where p is a function of o and j’ is a contravariant vector satisfying 


ra) 
at VC-8)F) = 0, (x! = Xe. x = t), 
show that 
Ae“ 
P= of’ 


where A is a constant. 
(L.U.) 
12. By replacing the spherical polar coordinate r occurring in the 
Schwarzschild metric (52.11) by a new coordinate r’ where 


r=r i : 
~ 2r’}’ 


obtain this metric in ‘isotropic’ form, viz. 


4 2 
m 1 —m/2r 
ds = {14+— d 2a 12 dQ? +. 12 gin? Od 2). ofS 2 dt. 
( yy) : eee a aioe) © 
13. Employing a certain frame, an event is specified by spatial 
coordinates (x,y,z) and a time ¢. The corresponding space-time 
manifold has metric 


ds? = dx2+dy2+dz2+ 2at dx dt—(c?— a? t?) dt?. 


Show that a particle falling freely in the gravitational field observed 
in the frame has equations of motion 


x= A+Bt—hat?, y= C+Dt, z= E+Ft, 


where A, B, C, D, E, Fare constants. By transforming to coordinates 
(x’, y,z,t), where x’ = x+4at?, and recalculating the metric, explain 
this result. 

14. (x1,x2,x3) are spatial coordinates of an event relative to a 
frame S and x‘ is the time of the event measured by a clock in S. A 
second frame J is falling freely in the neighbourhood of P and may 
be regarded as inertial. Oy! y2y3 are rectangular cartesian axes in J 
and y*/ic represents the time within J as measured by synchronised 
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clocks attached to the frame. Show that g,,, the metric tensor in S, is 
given by 


P is a point, fixed in S, having coordinates (x1, x2, x3). At the instant 
x4, Tis chosen so that P is instantaneously at rest in J. Deduce that 


ay* = 8i4 . 
ax! (844) 
dl is the distance between P and a neighbouring point 
P(x! + dx), x2 + dx2, x3 + dx3) 
as measured by a standard rod in J at the instant x4. Prove that 
dl? = dy%dy* = yy dxAdxt, 


where «, A, 4 range over the values 1, 2, 3 and 


848 4 
§44 


(Yr is the metric tensor for the #3 which is S at the instant x4.) 

15. Oxyz is a rectangular cartesian inertial frame J. A rigid disc 
rotates in the xy-plane about its centre O with angular velocity w. 
Polar coordinates (r, 9) in a frame R rotating with the disc are defined 
by the equations 


Yap = Sdn 


x =rcos(@+wt), y= rsin(@+wt), 


where ¢ is the time measured by synchronised clocks in the inertial 
frame. If the time of an event in R is taken to be the time shown by 
an adjacent clock in J, show that the space-time metric associated 
with R is 
ds? = dr2+r2d62+ 2wr? dO dt —(c2 —r? w?) dt?. 
Deduce that the metric for geometry in R is given by 
r2 d@2 


Oe ipl hohe 
dl r + utp 
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(Hint: employ the result of the previous exercise.) Hence show that 
the family of geodesics on the disc is determined by the equation 


8 = const.—sin™! (*)- = / (r2 ~ a), 
r}) r 
where r, = c/w and |a| < r,. Sketch this family. What is the physical 
significance of r,? 

16. x! (i= 1,2,3,4) are three space coordinates and time relative 
to a reference frame S. A test particle is momentarily at rest in S at 
the point (x1, x2, x3) at the time x4. If g,; is the metric tensor for the 
gravitational field in .S, write down the conditions that the world-line 
of the particle is a geodesic and deduce that 


ge od (oh Bae) ee, 
(dx4)2—-2\ ax! = gag x4] Ox4 
where the Greek index ranges over the values 1, 2, 3. Hence show that 
the covariant components of the particle’s acceleration in S are given 
by Q 
d2x aU dy 
ee ete he 1/27, 
YoB (axa)? ax fereu) ax4 
where y,g is defined in exercise 14 and 


844 = —(c2+2U), Ve = Saal V(— 848). 
(U, y, are the gravitational scalar and vector potentials respec- 
tively.) 

Show that, in the case of the space-time metric appropriate to the 
rotating frame of exercise 15, the gravitational vector potential 
vanishes and the scalar potential is given by U=4w?r2, Interpret 
this result in terms of the centrifugal force. 

17. De Sitter’s universe has metric 


ds2 = —A~1dr2—r2 d62—r2 sin? 0db2+ Ac? dt?, 
where A=1-—r2/R2, R being constant. Obtain the differential 


equations satisfied by the null-geodesics and show that along null- 
geodesics in the plane 6 = 47 


d 
«Fj = r(r2—a2)1/2, 
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where a is a constant. Deduce that, if r, ¢ are taken to be polar 
coordinates in this plane, the paths of light rays in this universe are 


straight lines. . 
18. Einstein’s universe has the metric 
1 
ds? = ¢c2dt2— dr2 — r2 d62 — r2 sin? 0 dd2, 
= 1—Ar2 : 6 


where (r,9,¢) are spherical polar coordinates. Obtain the equations 
governing the null-geodesics and show that, in the plane 6 = 47, 
these curves satisfy the equation 


dr \2 
—} = (1 —Ar?)(ur2—-1), 
(G) = Pa-aaGr—0 
where p is a constant. Putting r2 = 1/v, integrate this equation and 
hence deduce that the paths of light rays in the plane @ = 47 are the 


ellipses 
Ax?+ py? = 1, 


where (x,y) are rectangular cartesian coordinates. Show, also, that 
the time taken by a photon to make one complete circuit of an 
ellipse is 27/(cA!/2), 

19. If the metric of space-time is 


ds? = kadt? — «*(dx? + dy? + dz?), 


where « is a function of x alone and k is a constant, obtain the 
differential equations governing the world-lines of freely falling 
particles. If x,y,z are interpreted as rectangular cartesian co- 
ordinates by an observer and ¢ is his time variable, show that there is 
an energy equation for the particles in the form 


k 
4y2 — ae constant. 
a 


20. Explain why equation (23.6) remains valid in a gravitational 
field. 

21. (r,9,¢,t) are interpreted as spherical polar coordinates and 
time. A gravitational field is caused by a point electric charge at the 
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pole. Assuming that the space-time metric is given by equation 
(51.10) and that the 4-vector potential for the electromagnetic field 
of the charge is given by 2 = (0,0,0, x), where x = y(r), calculate the 
covariant components of the field tensor F;; from equation (23.6) and 
deduce the contravariant components F¥. Assuming that J'=0, 
prove that Maxwell’s equations are all satisfied if 


dx e 
Ee ne 
; 72°¢ / (ab), 


where e is a constant. 
Calculate the elements of the mixed energy-momentum tensor 
from the equation 


and write down Einstein’s equations for the gravitational field. Show 
that these are satisfied provided 


where m is a constant. 
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